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SEOUENCE-SPECTPTC GLYCOCOWJUCATE TR ANSCRTPTIONAL ANTAGONISTS 

( 

5 This invention was made in the course of work supported 

by the U.S. Government and Howsurd Hughes Medical Institute, which 
may have certain rights in this invention. 

FIEH? 0? Tm IWfWVXO^ 

10 The invention provides compositions and methods for 

modulating transcriptional activity of specific genes in 
eukaryotic cells by selectively inhibiting binding interactions 
between DNA-binding proteins (e«g., transcription factors) and 
their DNA recognition sites. The invention encos^asses novel 

15 ONA-binding glycoconjugates, pharmaceutical and biochemical 
reagent compositions thereof, methods for modulating 
transcriptional activity of predetermined gene sequences, methods 
for treating or preventing pathological conditions by modulating 
gene expression, methods for -suppressing T cell activation to 

20 provide therapeutic or -prophylactic immunosuppression, and 
methods for identifying novel glycoconjugates that are sequence- 
specific transcriptional antagonists. 

BACKGROUND 

25 Many pathological conditions and genetic diseases, 

including but not limited to neoplasia, autoimmune processes, 
infectious diseases, and the like could be treated or prevented 
if there were methods and compositions to facilitate the directed 
modulation of the transcriptional activity of a predetermined 

30 gene (or genes) in cells. Several approaches to modulating gene 
expression have been investigated, with variable success. 

Transgenesis, or transferring an exogenous expression 
gene cassette into a cell, has been used for placing an exogenous 
gene linked to a predetermined transcriptional control sequence 

35 into a cell or a non-human animal embiryo. Transgenesis has been 
used in many applications, including human in vivo and e3g vivo 
gene therapy, making transgenic animals for commercial and 
academic research, and the llke« Homologous gene targeting has 



H 
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als been enpl yed to make so*-called "knockout" animals r having 
functionally disrupted predeterained genes (Jasin and Berg (1988) 
gen^g and DgyglQffffignt Z: 1353; Ooetschman et al. (1988) Proc. 
Na-bl. Acad, Scl. m.S.A.) 8583; Dorini et al. (1989) Science 

5 212: 1357; Itzhaki and Porter (1991) Nucleic Acidg Res. 19 ; 3835; 

Valancius and Smithies (1991) Mol. Cell. Biol, 11: 1402; Koller ^ 
et al. (1991) Proc, Natl. Acad. Sci. (U.S.A. ^ S&: 10730 and 
Snouwaert et al. (1992) Science 257 : 1083). However , 
inactivating gene function by homologous gene targeting, is 
10 difficult and generally leads to a complete ablation of -gene 
function. 

Antisense oligonucleotides have also been proposed as 
"code blockers" that will inhibit the expression of specific 
genes on the basis of sequence-specific binding,, either to 

15 inhibit translation, transcription, or both (Ching et al. (1989) 
Proc. Natl. Acad. Sci. U.S.A. 86 : 10006; Broder et al. (1990) 
Ann. Int. Med . Xlli 604; Loreau et al. (1990) FEBS Letters 274 : 
53; Holcenberg et al. W091/11535; Cooney et al. (1988) Science 
241 : 456; Antisense RNA and DNA ,. (1988), D.A. Melton, Bd. , Cold 

20 Spring Harbor Laboratory, Cold Spring Harbor, NY) . One mechanism 
by which sense or antisense oligonucleotides may function is via 
formation of triplex or quadruplex DNA structures in a 
corresponding chromosomal gene locus (Cheng et al. (1988) ^ 
Biol. Chem. 263 : 15110; Ferrin and Camerini-Otero (1991) Science 

25 354: 1494; Ramdas et al. (1989) J. Biol. Chem. 264 : 17395; 
Strobel et al. (1991) Science 254 : 1639; Rigas et al. (1986) 
Proc. Natl. Acad. Sci. fU.S.A.^ 83 : 9591; Camerini-Otero et al. 
U.S. 7,611,268; W093/05178; Beal PA and Dervan PB (1991) Science 
251 ; 1360) . Antisense polynucleotides theoretically have the 

30 advantage of producing reversible effects and of producing 
titratable dose-response relationships, albeit there is little 
experimental evidence demonstrating these properties. 

I Both approaches to controlling gene expression, 
transgenesis/homologous recombination and antisense/ triplex 

35 rec[uire transferring exogenous polynucleotide sequences into 
cells, which is problematic, especially for pharmaceutical 
administration. Further, both approaches tend to produce a long- 
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lived effect and are poorly tltratable (i.e., they tend t 
produce an ••all-or-n ne" modulation of gene activity) . Perhaps 
m r importantly, both approaches have met with limited success 
for producing desired modulation of specific gene activity in 
5 vivo . 

Some contemporary small molecule pharmaceuticals (i.e., 
molecular weight less than about 1,000-3,000 daltons) act to 
modulate gene e3cpression of particular genes or gene subsets, and 
usually have advantageous pharmacological properties compared to 

10 oligonucleotides. However, most such small molecules usually act 
in an indirect fashion that is not sequence-specif ic (e.g. , they 
act by inhibiting an "upstream** signal transduction pathway that 
does not consist of transcriptional complexes) • For example, 
glucocorticoids and estrogen/ progesterone contraceptives modulate 

15 steroid responsive gene expression by interacting with 
proteinaceous steroid hormone receptors, and do not themselves 
form direct sequence-specific interactions with predetermined 
target gene sequences • Frequently , such small molecule 
pharmaceuticals possess undesirable side effects or affect a 

20. broad spectrum of expressed gene sequences where only a specific 
effect on the expression of a paorbicular gene is desired. 

Thus, there exists a need in the art for sequence- 
selective DNA-binding agents that can be used to modulate 
expression of predetermined genes and which can be formulated for 

25 facile and efficacious delivery into cells and organisms. The 
present invention fulfills these and other needs. 

The references discussed herein are provided solely for 
their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an admission 

30 that the inventors are not entitled to antedate such disclosure 
by virtue of prior invention. All publications and patent 
applications herein are incorporated by reference to the same 
extent as if each individual publication or patent application 
was specifically and individually indicated to be incorporated 

35 by reference. 
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SUMMARY OF THE INVENTTOW 

The present invent! n provides several novel methods 
and compositions for modulating the transcriptional activity of 
genes that are under the transcriptional regulation of 
5 predetermined regulatory sequences to which transcription 
factors, accessory proteins, or other DNA-binding proteins bind. 
The present invention provides novel glycocon jugate DNA ligands 
which bind to a predetermined cis-acting DNA sequence and inhibit 
or disrupt binding of a DNA-binding protein to said predetermined 

10 cis-acting DNA sequence, thereby producing a modulation in the 
transcription rate of a structural gene(s) under the 
transcriptional control of the predetermined cis-acting DNA 
sequence* Typically, the predetermined cis-acting DNA sequence 
is a binding site for a transcription factor, such as a 

15 eukaryotic transcription factor, often an inducible eukaryotic 
transcription factor (e.g., nuclear activator of activated T 
cells, or NFAT), or other site at which a polypeptide having 
transcriptional regulatory activity binds. By interacting with 
the cis-acting DNA sequence, the glycoconjugate DNA ligand 

20 displaces, disrupts, or inhibits the binding of a DNA-binding 
transcriptional regulatory protein to the cis-acting sequence and 
interferes with (i.e., inhibits) the activity of the DNA-binding 
transcriptional regulatory protein, whether the DNA-binding 
transcriptional regulatory protein is a positive or negative 

25 regulator of transcription at the locus. Advantageously, these 
novel glycoconjugate DNA ligands reversibly bind to the DNA 
recognition sequences in a sequence-specific or sequence- 
selective manner and do not form reactive species (e.g., 
radicals) that damage or cleave the DNA. Such glycoconjugate DNA 

30 ligands can be used for various applications, including but not 
limited to their use as pharmaceuticals for effecting therapeutic 
modulation of the transcription of specific genes. 

The glycoconjugate DNA ligands of the present invention 
bind to DNA, preferably double-stranded DNA in the B helical 

35 form, in a sequence-selective manner such that the glycoconjugate 
DNA ligand preferentially binds to a sequence motif in 
chromosomal DNA of about at least four nucleotides in length, 
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thus f rmlng a llgemded DNA recognition site c nplex. In one 
embodlaenti the glyc conjugate DNA-ligand binds to a 
predetermined DNA recognition site at which a DNA-binding protein 
is presently bound (i.e., an occupied recognition site) and 
5 displaces the bound DNA-binding protein from the recognition 
site^ thus forming a llganded DNA recognition site complex which 
does not contain the DNA-binding protein. In another embodiment, 
the glycocon jugate DNA ligand binds to a predetermined 
recognition site to which a DNA-binding protein is presently 
10 bound without dissociating the DNA*-blnding protein from the site, 
thus forming a llganded DNA recognition site complex which 
contains the DNA-binding protein, but wherein the biological 
activity of the DNA-binding protein at the predetermined 
recognition site is inhibited or altered. In a further 
15 embodiment, a glycocon jugate DNA ligand preferentially binds to 
an unoccupied predetermined recognition site and inhibits binding 
of the DNA-binding protein to its recognition site sequence. 

The present invention also provides compositions 
containing glycocon jugate DNA llgands that can serve as reagents 
20 to displace a bound DNA-binding protein from its recognition site 
and/or inhibit binding of a DNA-binding protein to its 
recognition site sequence. These reagents comprise an effective 
concentration of a glycocon jugate DNA ligand and will be used in 
a variety of commercial and research applications, such as 
25 elutlng DNA-binding proteins from a predetermined DNA recognition 
sequence, such as in elutlng specifically bound NPAT protein (s) 
from a polynucleotide or oligonucleotide affinity column 
containing a NFAT polynucleotide recognition sequence. In one 
variation, these reagents are used for preparative scale 
30 commercial purification of NFAT from cell or nuclear extracts; 
such purified NPAT will itself find use as a commercial research 
or diagnostic reagent. In some embodiments, compositions 
comprising a therapeutically or prophylactically effective dosage 
of a glycocon jugate DNA ligand are formulated for administration 
35 to a human or veterinary patient for treating a disease condition 
by modulating the transcription of predetermined genes for 
therapeutic benefit. In one variation, a glycocon jugate DNA 
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ligand, such as an aUcyl glycoside derivative of calicheamicin 
(e.g., CLM-MG) is administered to a patient to inhibit 
binding of the NFAT transcripti n factor to its recogniti n 
sequence (s) and, for example, to inhibit T ceil activation and 
5 effect immunosuppression to prevent tissue graft rejection 
reactions. 

The invention also provides methods for identifying 
novel sequence-selective glycocon jugate DNA ligands that bind 
preferentially to a predetermined DNA recognition sequence to 
10 which a transcription factor can bind. These methods generally 
comprise the steps of (1) synthesizing a glycocon jugate DNA 
ligand having a carbohydrate recognition element comprising a 
mixed polysaccharide/ aryl substituent having the backbone 
structure of calicheamicin 71^, with one or more substituent 
15 modifications or substitutions (e.g., replacing the iodo 
substituent on the thiobenzoate ring with a bromo, chloro, or 
methyl substituent) and lacking the reactive diyne-ene ring 
structure, optionally also in covalent linkage to one or more 
non-interfering substituents replacing the diyne-ene substituent 
20 position, (2) contacting the glycocon jugate DNA ligemd with a 
polynucleotide having a predeterained recognition sequence to 
which a predetermined sequence-specific DNA-binding protein 
(e.g., transcription factor) binds in the presence of said 
sequence-specific DNA-binding protein under binding conditions 
25 (e.g., an NFAT recognition site), (3) determining the ability of 
the glycoconjugate DNA ligand to inhibit the binding of the 
predetermined sequence-specific DNA-binding protein to the 
predeteinnined DNA recognition sequence, and (4) identifying 
glycoconjugate DNA ligands which inhibit said protein binding to 
30 the predetermined DNA secpience as being a sequence-selective 
glycoconjugate DNA ligand. Typically, sequence-selectivity is 
demonstrated by including in the binding reaction additional 
species of DNA binding proteins (i.e., different than said 
predetermined DNA binding protein), termed "alternative DNA 
35 binding proteins'*, and additional polynucleotides having 
different recognition DNA sequences (i.e., different than said 
predetermined DNA recognition sequence) , termed "alternative DNA 
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recognition sequences'* which bind to said alternative DNA binding 
pr teins, and determining that the glycocon jugate DNA ligand 
preferentially inhibits binding of the pr d termined DNA binding 
protein to its predetermined DNA recognition sequence as compared 
5 with the alternative species of DNA binding proteins and their 
cognate alternative DNA recognition sequences . Usually, 
alternative DNA binding proteins and alternative DNA recognition 
seqpiences are transcription factors and transcription factor 
recognition sequences, respectively, that are distinct from the 

10 predetermined DNA-binding protein and the predetermined DNA 
recognition sequence (s) . Thus, if the predetermined DNA-binding 
protein is NPAT and the predetermined DNA recognition sequence 
is an NFAT recognition site (e.g. , 5 • -AAGGAGGAAAAACTGTTTCAT-3 • ) , 
examples of alternative DNA-binding proteins are Spl amd API and 

15 their cognate DNA recognition sequences 5 ' <-GGGGCGG6GC*-3 * and 5 * - 
6T6ACTCA6C6C6-3 * , among others. 

In a variation of the above-described general method 
for identifying novel sequence-selective glycocon jugate DNA 
ligands that bind preferentially to a specific DNA sequence, said 

20 ligands are identified by screening a library of glycogen jugate 
DNA ligands with a predetemnined DNA sequence. Alternatively, 
DNA sequences that bind to de novo synthesized glycocon jugate 
compounds that lack a predetermined DNA recognition site can be 
identified by screening a library of DNA sequences with a unique 

25 glycocon jugate ligand. 

Glycocon jugate DNA ligand-mediated inhibition of 
binding of a transcription factor to its recognition site (s) may 
be conveniently assayed using a transcription assay, such as an 
in vitro transcription assay or a transient (or stable) 

30 transfection assay employing a reporter gene operably linked to 
a transcriptional regulatory sequence comprising the 
predetermined transcription factor recognition site(s) . A 
glycocon jugate DNA ligand is introduced into or administered to 
a transcription assay wherein sequence-specific transcriptional 

35 regulation of a reporter gene is effected by a predetermined cis- 
acting operably linked transcription factor recognition site. 
Application of a glycocon jugate DNA ligand that preferentially 
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binds to the predetermined transcription factor recognition site 
and inhibits the transcriptional modulation c nf erred by the 
transcription factor recognition site on the transcription of th 
cis-linked reporter gene will result in altered transcription 
5 (e«g*, increased or decreased as compared to a control assay 
reaction lacking the glycocon jugate DNA ligemd) of the reporter 
gene. The ability of a glycocon jugate DNA ligand to inhibit the 
transcriptional effect produced in the control assay by binding 
of the transcription factor to its recognition site(s) in the 

10 predetermined transcriptional regulatory sequence will indicate 
that the sequence-selective glycocon jugate DNA ligand and/or its 
related structural congeners can be used as a modifier of 
specific gene expression for genes or other expressible 
polynucleotides under a transcriptional influence of the 

15 predetermined transcriptional regulatory sequence. 

The activity of a glycocon jugate DNA ligand as a 
modifier of specific gene expression may be employed for 
therapeutic benefit to treat diseases wherein modification of the 
expression of specific genes or gene subsets can reduce or arrest 

20 the pathological condition (e.g., by immune modulation, reversal 
of neoplastic transformation, inhibition of metastatic 
phenotypes, for gene therapy, and the like) • For therapeutic or 
prophylactic treatment of a disease condition, a therapeutic or 
prophylactic dosage of a sequence-selective glycocon jugate DNA 

25 ligand of the invention is administered to a patient; such a 
therapeutic or prophylactic dosage will be sufficient, alone or 
in combination with another pharmaceutical agent, to alter the 
transcription rate and/or net transcription of specific genes 
under the transcriptional influence of the predetermined DNA 

30 recognition sequence to which the glycocon jugate DNA ligand 
preferentially binds. 

The invention also provides therapeutic and/or 
prophylactic compositions of sequence-selective glycoconjugate 
DNA ligands for use as pharmaceuticals or reagents; such 

35 compositions comprise an effective amount or concentration of one 
or more sequence-selective glycoconjugate DNA ligands. 

In one embodiment, a glycoconjugate DNA ligand having 
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a callcheamicln-related structure lacking an enediyne core, 
callcheamicln MG, is provided for inhibiting the transcription 
of at least one structural gene under the cis-acting 
transcriptional influence of a DNA recognition site sequence for 
5 the transcription factor NPAT. The specific inhibition of one 
or more NFAT-dependent genes may be adveoitageously accomplished 
by administration of the glycoconjugate DNA ligand to produce a 
modulation of immune function, such as an inhibition of T 
lymphocyte activation (e.g«, IL-2 expression) for reducing graft 
10 rejection reactions, among other uses. 

BRIEF DESCRIPTION OF THE FIGURES 

Fig* 1 shows the structures of the enediyne antibiotic 
calicheamicin 71^ and the glycoconjugate DNA ligauid 

15 calicheamicin-MG, which represents the methyl glycoside 
derivative of calicheamicin 71^. 

Fig • 2 shows the transcription factor recognition 
seqfuencQS used. NFAT site is the human IL-2 enhemcer sequence - 
286 to -267; API site is derived from the human metallothionein 

20 enhancer; NFjcB site is derived from the mtirine inqaunoglobulin 
kappa light chain enhancer; Octl site is the human IL-2 enhancer 
sequence -85 to -65; Octl/octamer-associated protein (OAP) site 
is the human IL-2 enhancer sequence -101 to -65; HNF-1 site is 
the human jS-flbrinogen enhancer sequence -103 to -76; Spl site 

25 is the consensus Spl binding site. Sequences in lower case 
represent potential calicheamicin binding site(s). 

Fig. 3 shows sequence-selective inhibition of DNA- 
protein complex formation by calicheamicin MG (CLH-MG) . Lane l, 
buffer control; lanes 2-5, CLM-HG added to a final concentration 

30 of 100 juM, 32 /iM, 10 /iM, and 3.2 juM, respectively. NFAT, API, 
and NFkB activities are from TAg Jurkat nuclear extracts; Octl, 
Octl /OAP, and Spl activities are from HeLa nuclear extracts; HNFl 
activity is from MDCK cell nuclear extract. Identical results 
were obtained for Octl, Octl/OAP, and Spl activities from TAg 

35 Jurkat cells, and for API and NFkB activities from HeLa cells. 
NS indicates DNA binding activity non-saturable by a 100-fold 
excess of cold probe oligonucleotide. 
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Fig. 4 sh ws quantitati n of gel shift data f r NPAT 
(8 lid circles, n«4), API (solid squares, n=3), and HNFl (open 
squares, n»3) . Results are presented as percent inhibition of 
the DNA binding activity measured in the absence of CLH-MG and 
5 represent mean +/- standard error of the mean. 

Fig. 5 shows representative gel shift data 
demonstrating that alteration t>f the CLM recognition element 
within the NFAT binding site alters sensitivity of NFAT binding 
to inhibition by CLM-MG. The NFAT DNA probe sequences used are 

10 shown (in part) to the left of each gel shift panel. The upper 
sequences represent the wild-type NFAT probe sequence. The 
middle and lower sequences contain sequence changes (indicated 
in bold) that alter the CLM-MG binding site. DNA-protein 
complexes were allowed to form at room temperature for 60 minutes 

15 and transferred to 4**C. After 15 minutes, control buffer (lane 
1), CLM-MG at approximately 100 fM, 32 iM, 10 mM (lanes 2-4, 
respectively), or 100-fold excess cold probe (lane 5) was added. 
After one minute, the samples were electrophoresed. 

Fig. 6 represents a quantitation of the gel shift data 

20 exemplified in Fig. 5. Open squares represent wild-type NFAT 
probe; closed squares represent an A insertion (ns4) ; closed 
circles represent a G to T substitution (n=4} . Points represent 
mean; eirror bars indicate standard error of the mean. The 
nucleotide sequences are those shown in Fig. 5. 

25 Fig. 7 shows sequence-specific inhibition of 

transcription in vivo by CLM-MG. (Top panel) TAg Jurkat cells 
were transf ected with the indicated secreted alkaline phosphatase 
reporter DNA. Approximately 24 hours later, the cells were 
incubated with 200 /iM CLM-MG or buffer control for 15 minutes. 

30 The cells were then stimulated with ionomycin and TPA, resulting 
in a final concentration of 100 CLM-MG. Secreted alkaline 
phosphatase was measured after 12-16 hours. Results eure 
expressed as percentage of reporter gene activity stimulated by 
ionomycin plus TPA in absience of CLM-MG and represent the mean 

35 +/- standard error of the mean. (Bottom panel) The percentage 
of inhibition of reporter gene expression dependent on various 
transcription factors was determined as above except the 
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cone ntration of CLM-M6 was veuried from 0 to 250 /aM to 
demonstrate concentration-dependence. NFAT-SX (open circle) , API- 
SX (closed circle) , NFkB-SX (open sc[uare) , or Octl-SX (closed 
square) . 

5 

PETINITIQNg 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meaning as commonly understood 
by one of ordinary skill in the art to which this invention 

10 belongs. Although any methods and materials similar or 
equivalent to those described herein can be used in the practice 
or testing of the present invention, the preferred methods and 
materials are described. For purposes of the present invention, 
the following temns are defined below. 

15 The term "naturally-occurring" as used herein as 

applied to an object refers to the fact that an object can be 
found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) that 
can be isolated from a source in nature and which has not been 

20 intentionally modified by man in the laboratory is natiurally- 
occurring. As used herein, laboratory strains of rodents which 
may have been selectively bred according to classical genetics 
are considered naturally-occurring animals. 

, The term "glycoconjugate DNA ligand" as used herein 

25 refers to DNA-binding compoimds comprising a structural backbone 
of calicheamicin yl^ or esperamicin A^ and like compounds and 
lacking a reactive enediyne core substituent. Preferably, a 
glycocon jugate DNA ligand comprises a species of Stxructiure I and 
binds DNA in a sequence-selective manner; calicheamicin MG is a 

30 preferred species of glycocon jugate DNA ligand for some 
embodiments (e.g., NFAT recognition site binding). 

The term "sequence-selective binding" as used herein 
refers to the property of a compound, such as a glycocon jugate 
DNA ligand, to preferentially bind to one or more DNA sequences 

35 of more than two and less than 100 nucleotides in length as 
compared to the compound's binding affinity (e.g., Kd) to a pool 
of random sequence DNA r to bulk human, calf, or salmon genomic 
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DNA. For example, a compound such as calicheamicln 7!^ may 
preferentially bind to the tetranucle tide sequence TCCT, TCCC, 
TCCA, ACCT, TCCG, GCCT, CTCT, TCTC, or TTTT as compared t the 
remaining tetranucleotide sequence combinations, thus exhibiting 
5 sequence-selective binding* Within a group of sequences that are 
selectively bound, a subset of those sites may be more 
preferentially bound (i.e., have a higher binding affinity). 
Calicheamicln 71^ preferentially binds to 
homopyrimidine/homoptirine sites, such as 5**TCCT/AGGA and 
10 TCTC/GAGA. 

The term ''transcriptional enhancement** is used herein 
to refer to functional property of producing an increase in the 
rate of transcription of linked sequences that contain a 
functional promoter. 

15 The term **agent** is used herein to denote a chemical 

compound, a mixtxire of chemical compounds, a biological 
macromolecule, or an extract made from biological materials such 
, as bacteria, plants, fungi, or animal (particulaurly mammalian) 
cells or tissues. Agents are evaluated for potential activity 

20 as transcriptional modifiers (e.g., immunosuppressants, 
antineoplastics, etc.) by inclusion in screening assays described 
hereinbelov. Agents may be selected for their ability to 
displace a sequence-selective glycocon jugate DNA ligand from . a 
predete3nnined polynucleotide sequence; such agents may be 

25 administered in conjunction with a sequence-selective 
glycocon jugate DNA ligand for various uses, such as an antidote 
to reverse the effects of a previously administered 
glycoconjugate DNA ligemd, or as a sharpening agent to enhance 
the binding preference of the glycoconjugate DNA ligand for a 

30 particular predetermined DNA sequence as compared to other 
similar sequences. 

The terms "immunosuppressant" and "immunosuppressant 
agent" are used herein interchangeably to refer to agents that 
have the functional property of inhibiting an immune response in 

35 hiunan, particularly an immune response that is mediated by 
activated T-cells. 

As used herein, a "transcription factor recognition 
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site" and a "transcription factor binding site" refer to a 
polynucleotide s qu nc (s) or sequence aotif (s) which are 
identified as being sit s for the sequence-specific interaction 
of one or more transcription factors, frequently taking the form 
5 of direct protein-DNA binding. Typically, transcription factor 
binding sites can be identified by DNA footprinting, gel mobility 
shift assays, and the like, and/ or can be predicted on the basis 
of known consensus sequence motifs, or by other methods known to 
those of skill in the art. For example and not to limit the 

10 invention, eukaryotic transcription factors include, but are not 
limited to: NFAT, API, AP-2, Spl, OCT-1, OCT-2, OAF, NPjcB, CREB, 
CTF, TFIIA, TFIIB, TFIID, Pit-1, C/EBP, SRF (Mitchell PJ and 
Tijan R (1989) Science 371). For purposes of the invention, 

steroid receptors, RNA polymerases, and other proteins that 

15 interact with DNA in a secpience*-specif ic manner and exert 
transcriptional regulatory effects are considered transcription 
factors. 

The term "NFAT-dependent gene** is used herein to refer 
to genes which: (1) have a NFAT recognition site within 10 

20 kilobases of the first coding exon of said gene, and (2) manifest 
an altered rate of transcription, either increased or decreased, 
from a major or minor transcriptional start site for said gene, 
wherein such alteration in transcriptional rate correlates with 
the presence of NFAT. 

25 The terms "transcriptional modulation" and 

"transcriptional influence" sure used herein to refer to the 
capacity to either enhance transcription or inhibit transcription 
of a structural sequence linked in cis; such enhancement or 
inhibition may be contingent on the occurrence of a specific 

30 event, such as stimulation with an inducer smd/or may only be 
manifest in certain cell types. For example but not for 
limitation, expression of a protein that prevents formation of 
an activated (e.g., ligand-bound) glucocorticoid receptor will 
alter the ability of a glucocorticoid-responsive cell type to 

35 modulate transcription of an glucocorticoid-responsive gene in 
the presence of glucocorticoid. This alteration will be manifest 
as an inhibition of the transcriptional enhancement of the 
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glucocorticoid-responslve gene that normally ensues following 
stimulation with glucocorticoids. The altered ability to 
modulat transcripti nal enhancement or inhibition may affect the 
inducible transcription of a gene, such as in the just-cited 
5 example, or may effect the basal level transcription of a gene, 
or both. For example, a reporter polynucleotide may comprise a 
glucocorticoid-inducible enhancer-promoter directing 
transcription of a sequence encoding a reporter protein (e.g., 
lucif erase, i9-galactosidase, chloramphenicol acetyltrzmsf erase) . 

10 Such a reporter polynucleotide may be transferred to a 
glucocorticoid-responsive cell line for use as a reporter host 
cell to screen a panel of glycocon jugate DNA ligands for the 
2ibility to affect a glucocorticoid-induced transcriptional 
modulation by sequence-selective interaction with one or more 

15 glucocorticoid DNA recognition site sequences. Glycocon jugate DNA 
ligands that enhance transcription of the cis-linked reporter 
gene in the absence of glucocorticoids may be identified as 
putative positive regulators of glucocorticoid-response elements, 
whereas glycocon jugate DNA ligands that silence expression of the 

20 reporter in cells cultured in the presence of glucocorticoids may 
be identified (e.g., by selecting cells not expressing the cell 
surface reporter) as glycocon jugate DNA ligands that interfere 
with the expression of glucocorticoid-responsive genes. Numerous 
other specific examples of transcription reignilatory elements, 

25 such as specific enhancers and silencers, are known to those of 
skill in the art and may be selected for use in the methods and 
polynucleotide constructs of the invention on the basis of the 
practitioner's desired application. Literature sources and 
published patent documents, as well as GenBank^ and other 

30 sequence information data sources can be consulted by those of 
skill in the art in selecting suitable transcription regulatory 
elements for use in the invention. Where necessary, a 
transcription regulatory element may be constructed by synthesis 
(and ligation, if necessary) of oligonucleotides made on the 

35 basis of availEdile sequence information (e.g., GenBank sequences 
for a CD4 enhancer or a SV40 early promoter) . For example but 
not for limitation, a glycocon jugate DNA ligand that prevents 
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transcriptional activation f NFAT-dependent genes in the 
presence of functional NFAT generally will alter the ability of 
a T cell to modulate transcription of an IL*2 gene in response 
to an antigen stimulus. This alteration will be manifest as an 
5 inhibition of the transcriptional enhancement of the IIr-2 gene 
that normally ensues following T cell stimulation. The altered 
ability to modulate transcriptional enhancement or inhibition 
may affect the inducible transcription of a gene, such as in the 
just-cited IL-2 example, or may effect the basal level 

10 transcription of a gene, or both. Agents that disrupt, for 
example, binding of silencer proteins to silencer transcription 
regulatory elements typically produce an increase in basal and/or 
induced transcription rate of a cis^linked gene. 

As used herein, the term "operably linked** refers to 

15 a linkage of polynucleotide elements in a functional 
relationship. A nucleic acid is **operably linked** when it is 
placed into a functional relationship with another nucleic acid 
sequence. For instance, a promoter or enhancer is operably 
linked to a coding sequence If it affects the transcription of 

20 the coding sequence. Operably linked mesms that the DNA 
sequences being linked are typically contiguous and, where 
necessary to join two protein coding regions, contiguous and in 
reading frame. However, since enhancers generally function when 
separated from the promoter by several kilobases and intronic 

25 sequences may be of variaU3le lengths, some polynucleotide 
elements may be operably linked but not contiguous. 

As used herein, the terms ** expression cassette** 
and ''reporter polynucleotide** refer to a polynucleotide 
comprising a promoter sequence and , optionally , an enhancer 

30 and/or silencer element (s) , operably linked to a structural 
sequence, such as a cDNA sequence or genomic DNA sequence 
encoding a reporter protein (e.g., luciferase, )9-galactosidase, 
chloramphenicol acetyltransf erase) , such that the reporter gene 
sequence is under the transcriptional influence of a cis-acting 

35 transcription factor binding site and/or recognition site. In 
some embodiments, an expression cassette may also include 
polyadenylation site sequences to ensure polyadenylation of 
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transcripts. When an expression cassett is transferred Into a 
sultabl host cell, the structural sequence Is transcribed from 
the oxpr sslon cassette promot r, and a translatable message Is 
generated, either directly or following appropriate RNA splicing, 
5 As used herein, the term "reporter host cell" refers 

to a eukaryotic cell, preferably a mammalian cell, which harbors 
a reporter polynucleotide. Preferably, the reporter 

polynucleotide is stably integrated into a host cell chromosomal 
location, either by nonhomologous Integration or by homologous 
10 sequence targeting, although transient transfectlon methods may 
be employed. 

As used herein, the term "endogenous * DNA sequence" 
refers to naturally-occurring polynucleotide sequences contained 
in a eukaryotic cell. Such sequences Include, for example, 

15 chromosomal sequences (e.g., structural genes, promoters, 
enhancers, recomblnatorial hotspots, repeat sequences, integrated 
provlral sequences) • A "predetermined sequence" is a sequence 
which may be selected at the discretion of the practitioner on 
the basis of known or predicted sequence inforxnation. An 

20 exogenous polynucleotide is a polynucleotide which is transferred 
into a eukaryotic cell. 

The term "alkyl" refers to a cyclic, branched, or 
straight chain alkyl group containing only carbon and hydrogen, 
and unless otherwise mentioned, contain one to twelve carbon 

25 atoms. This term is further exemplified by groups such as 
methyl, ethyl, n-propyl, Isobutyl, t-butyl, pentyl, pivalyl, 
heptyl, adamantyl, and cyclopentyl. Alkyl groups can either be 
unsubstltuted or substituted with one or more substltuents, e.g. , 
halogen, alkyl, alkoxy, alkylthlo, trlf luoromethyl , acyloxy, 

30 hydroxy, mercapto, carboxy, aryloxy, aryl, arylalkyl, heteroaryl, 
amino, alicylamino, dialkylcunino, morpholino, piperldlno, 
pyrrolldln-l-yl, piperazin-l-yl, or other functionality. 

The term "lower alkyl" refers to a cyclic, branched or 
straight chain monovalent alkyl radical of one to six carbon 

35 atoms. This term is further exemplified by such radicals as 
methyl, ethyl, n-propyl, i-propyl, n-butyl, t-butyl, i-butyl (or 
2-methylpropyl) , cyclopropy Imethyl , i-amyl, n-amyl, and hexyl. 
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The term "euryl" or "Ar" refers to a monovalent 
iinsaturated aromatic carbocycllc group having a single ring 
(e.g«, phenyl) or multiple condensed rings (e.g., naphthyl or 
anthryl) , which can optionally be unsubstituted or substituted 
5 with, e.g., halogen, alkyl, alkoxy, alkylthio, trifluoromethyl, 
acyloxy, hydroxy, mercapto, carboxy, aryloxy, aryl, arylalkyl, 
heteroaryl, eunino, alkylamino, dialkylamino, morpholino, 
piperidino, pyrrolidin-l-yl, piperazin-l-yl, or other 
functionality. 

10 The term "substituted alkoxy** refers to a group having 

the structure -0-R, where R is alkyl which is substituted with 
a non-interfering substituent. The term "arylalkoaey** refers to 
a group having the structure -0*R-Ar, where R is alkyl and Ar is 
an aromatic substituent. Arylalkoxys are a siibset of substituted 

15 alkoxys. Examples of preferred substituted alkoxy groups are: 
benzyloxy, napthyloxy, and chlorobenzyloxy. 

The term "aryloxy" refers to a group having the 
structure -O-Ar, where Ar is an aromatic group. A preferred 
aryloxy group is phenoxy. 

20 The term "non-interfering substituent" as used herein 

refers to a chemical substituent which does not comprise an 
enediyne structure or core aglycone (i.e., calicheamicinone 
substituent) and which does not, when present in a compound, 
result in the compound exhibiting a substantially reduced (e.g., 

25 by more than two orders of magnitude) affinity for binding DNA 
in a sequence-selective meumer. Frequently, non- interfering 
substituents can comprise moieties which enhance pharmacological 
or pheannacokinetic properties (e.g., hydrophobic moieties such 
as aryl or naphthalene groups, and the like). 

30 

D^TATTlfgD p^SCPIPTIOW 

A basis of the invention is the unexpected finding that 
structural variants of naturally occ\arring eyediyne anticancer 
antibiotics can interact with DNA in a sequence-selective manner, 
35 and this sequence-selective interaction can result in the 
displacement or inhibition of binding of certain DNA-binding 
proteins (e.g., transcription factors) to their recognition site 
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sequences. Glycocon jugate DNA llgands that comprise the 
structural backbone (i.e., the saccharide/ aryl backbone) of 
calicheamicln 71^ and other natiurally occurring n dlyne 
antibiotics (e.g., esperlnicln A^^) , and which lack the reactive 
5 enedlyne iaolety of the naturally occurring antibiotic, can 
possess seguence*selectlve DNA binding activity and lack DNA 
cleavage activity. These non-reactive glycocon jugate DNA llgands 
can possess sequence-selective, reversible noncovalent binding 
to DNA, and may be used to alter the interactions of certain DNA- 

10 binding proteins (e.g., transcription factors) to their DNA 
recognition sequences on a sequence-selective basis. The 
t2urgeted alteration of proteln-DNA binding interactions by the 
glycocon jugate DNA llgands of the Invention serve as the basis 
for selective elutlon of transcription factors from predetexialned 

15 DNA binding sites, for transcriptional modulation of genes under 
the transcriptional Influence of a DNA sec[uence to which the 
glycocon jugate DNA ligand preferentially binds, and for other 
embodiments. 

A preferred embodiment' is the callcheamlcin-based 

20 glycoconjugate DNA ligand, calicheamicln H6 shown in Fig. 1, 
which binds in a sequence-selective manner to the tetranucleotlde 
sequences TCCT, TCCC, TCCA, ACCT, TCCG, GCCT, CTCT, TCTC, or 
TTTT, especially to 5«-TCCT-3* (and its complement 5«-AGGA-3») 
and 5 • -TCTC-3 • ( and its complement 5 • -GAGA-3 • ) , and which 

25 preferentially binds to NFAT recognition site sequences 
interfering with the interaction of NFAT to its recognition 
slte(s), thereby modulating NFAT-dependent transcription. 

DNA Binding Proteins 
DNA-blndlng proteins perform fundamental functions in 

30 cell biology and In organlsmal physiology. DNA-blnding proteins 
are Involved In a variety of activities. Including replicating 
the genome, transcribing active genes, repressing inactive genes, 
repairing damaged DNA, and as basic components of chromatin 
structure. In particular, transcription factors regulate cell 

35 development, differentiation, and cell growth by binding to a 
specific DNA site (or set of sites) and regulating gene 
transcription. Critical to the function of transcription factors 
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Is their ability to selectiv ly bind to specific DNA sequence 
motifs located in or near structxiral gene sequ nces and 
regulatory elea nts (e*g., enhanc rs, silencers, promoters). 

Many DNA-binding proteins can be grouped into classes 
5 that use related structural motifs for recognition. Large, well- 
established families include, for example, the helix-turn-helix 
(HTH) proteins, the homeodomains, zinc finger proteins, steroid 
receptor DNA-binding domains, leucine zipper proteins, and the 
helix-loop-helix proteins, and jS-sheet DNA-binding proteins (Pabo 

10 CO and Sauer RT (1992) Ann. Rev. Biochem. fil: 1053) . There are 
many important DNA-binding proteins that are not categorized into 
the known structural families; the SV40 large T antigen and the 
human p53 tiimor suppressor gene are two such examples. 

Transcription factors in particular tend to possess 

15 sequence-specific binding properties which are associated with 
their activities as transcriptional regulators of adjacent 
structural genes. The precise physical basis of sequence- 
specific binding is not fully understood with regeurd to all known 
transcription factors, and the nature of sequence-selectivity may 

20 be vexry complex and require considerations of local variations 
in DNA structure and/or flexibility. However, most of the well- 
chauracterized families of DNA-binding proteins use a-helices to 
make base contacts in the major groove of B-form DNA, although 
i3-sheets or regions of extended polypeptide chain can also be 

25 used to make base contacts. 

Hydrogen bonding between peptide side chains and 
nucleic acid bases are frequently involved in sequence-specific 
recognition and binding, although contacts with the DNA backbone, 
especially hydrogen bonds to phosphodiester oxygens and 

30 hydrophobic interactions with the sugar rings in the DNA 
baclcbone, are often present as well. Structural studies of 
contacts with bases have revealed (1) direct hydrogen bonds 
between the protein side chains euid the bases, (2) hydrogen bonds 
between the polypeptide backbone and the bases, (3) hydrogen 

35 bonds mediated by water molecules, and (4) hydrophobic contacts 
(Pabo CO and Sauer RT (1992) OD.cit. ^ . Most base contacts appear 
to occur through the major gr ove of B-form DNA, although there 
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could be sequence^dependen^ Influences on the local structure of 
a given binding site that deviates from canonical B-form DNA. 
Furthermore, there does not appear t be a simple "recognition 
code" wherein a one-to-one correspondence between the amino acid 
5 side chains and the bases or DNA back)3one moieties they contact, 
further implicating the importance of local structure as a basis 
for sequence-specific recognition. 

^AT-'Dependent Transcriptional Activitv 

10 One class of tramscription factor binding sites are the 

DNA recognition sequences for the NFAT transcription factor, 
which is involved in T cell activation and the like. 

The immune response is coordinated by the actions of 
cytokines produced from activated T lymphocytes. The precursors 

15 for most T lymphocytes arise in the bone marrow^ and migrate to 
the thymus where they differentiate and express receptors capable 
of interacting with antigen. These differentiated T lyiophocytes 
then migrate to the peripheral lymphoid organs where they remain 
quiescent until they come in contact with the cognate antigen. 

20 The interaction of antigen with the antigen receptor on T 
lymphocytes initiates an ordered series of pleiotropic changes; 
a process denoted as T lymphocyte activation. T lymphocyte 
activation is a 7 to 10 day process that results in cell division 
and the accjuisition of immunological functions such as 

25 cytotoxicity and the production of lymphokines that induce 
antibody production by B lymphocytes and control the growth and 
differentiation of granulocyte and macrophage precursors. The 
cytokines produced by activated T lymphocytes act upon other 
cells of the immune system to coordinate their behavior and bring 

30 about an effective immune response. 

The initiation of T lymphocyte activation requires a 
complex interaction of the antigen receptor with the combination 
of antigen and self-histocompatibility molecules on the surface 
of antigen-presenting cells. 

35 T lymphocyte activation involves the specific 

regulation of particular subsets of genes. The transcriptional 
regulation characteristic of T cell activati n begins minutes 
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after th antigen encoxinter and continues until at least 10 days 
later. The T lymphocyte activation genes can be grouped 
according to the time after stimulation at which each gene is 
transcribed. Early genes are the first subset of T lymphocyte 
5 activation genes that is expressed during the activation process. 
Expression of the early genes triggers the .transcriptional 
modulation of subsequent genes in the activation pathway. 
Because of the critical role of the T lymphocyte in the immime 
response, agents that interfere with expression of the early 

10 activation genes, such as cyclosporin A and FK506, are effective 
immunosuppressants. 

The induction of cytokine production in T lymphocytes 
as a consequence of specific contact with antigens serves to 
coordinate the immune response. Cytokines^ are responsible for 

15 the control of proliferation and differentiation among precursors 
of B cells, granulocytes, and macrophages. One such cytokine 
that is produced by activated T cells is inter le\ikin-2 (IL-2) . 

The IL-2 gene is essential for both the proliferation 
and Immunologic function of T cells. Moreover, the IL-2 gene is 

20 a representative and important early gene that is transcribed at 
an enhemced rate during T cell activation. Putative signalling 
pathways connect the transcriptional regulation of the IL-2 gene 
with the antigen receptor on the T cell surface. 

Transcription of the early genes requires the presence 

25 of specific transcription factors, such as MFAT, which in turn 
are regulated through interactions with the antigen receptor. 
These transcription factors are proteins which act through 
enhancer and promoter elements near the early activation genes 
to modulate the rate of transcription of these genes. Many of 

30 these transcription factors reversibly bind to specific DNA 
sequences located in and near enhancer elements. 

The inter leukin-2 (IL-2) gene is a paradigmatic early 
activation gene. The IL-2 gene product plays a critical role in 
T lymphocyte proliferation and differentiation. The IL-2 gene 

35 is transcriptionally active only in T cells that have been 
stimulated through the antigen receptor or its associated 
mol cules (Cantrell and Smith (1984) Science 224 ; 1312) . The 
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transcrlptl nal induction of il-2 in activated T lymphocytes is 
mediated by a typical early gene transcrlptl nal enhancer that 
extends from 325 basepairs upstr am of the transcriptional start 
site for the IL-2 gene (Diirand et al. (1988) Mol, and cell« Biol . 
5 £: 1715) . Other genes known to contain NFAT recognition sites 
in their regulatory regions include: 7-interferon/ IL-4, GM-CSF, 
and others. This region, which is referred to herein as the lL-2 
enhancer, has been used extensively to dissect the requirements 
for T lymphocyte activation. An array of transcription factors, 

10 including NFAT, NFkb, API, Oct-1, md a newly identified protein 
that associates with Oct-1 called OAP-40, bind to sequences in 
this region (Ullman et al. (1991) Science 254 ; 558-562) . These 
different transcription factors act together to integrate the 
complex requirements for T lymphocyte activation. One of the 

15 functional seqfuences in the IL-2 enhancer is a binding site for 
a protein complex, designated NFAT (nuclear factor of activated 
X lymphocytes) , that functions as a transcriptional regulator of 
IL-2, IL-4, and other early activation genes (Shaw et al. (1988) 
Science 241 ; 202-205) • The NFAT complex is formed when a signal 

20 from the antigen receptor is transduced to the nucleus and an 
activated NFAT complex forms. Enhancement of transcription of 
genes adjacent to the NFAT recognition site requires that the 
NFAT complex bind to the recognition site (Shaw et al. (1988) 
Science 241 ; 202-205) . 

25 Among the group of tratnscription factors mentioned 

above, the presence of NFAT is chetracteristic of the 
transcription events involving early activation genes, in that 
its recognition sequence is able to enhance transcription of 
linked heterologous genes in activated T cells of transgenic 

30 animals (Verweij et al. (1990) J, Biol, Chem. 265: 15788). The 
NFAT sec[uence element is also the only knoim transcriptional 
element in the IL-r2 enhancer that has no stimulatory effect on 
transcription in the absence of physiologic activation of the T 
lymphocyte through the antigen receptor or through treatment of 

35 T cells with the combination of ionomycin and PMA. For example, 
the NFAT element enhances transcription of linked sequences in 
T lymphocytes which have had proper presentation of specific 
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antigen by MHC-matched antigen presenting cells or have been 
stimulated with the combinati n of ioncmycin/FHA, but not in 
unstimulated T lymph cytes (Durand et al,, Mol. and cell. Biol , 
1:1715 (1988); Shav et al., Science 24l !202-2Q5 fiQflfl) ; Karttunen 
5 and Shastri, Proc. Na tl. Acad, Sci, USA ia:3972-3976 (1991), 
1991; Verweij et al*, J, Biol, chem. 215:1578a-15795 (1990)). 
Moreover, the NPAT sequence element naturally enhances 
transcription of the IL-2 gene only in activated T lymphocytes. 

Transcriptional enhancement involving NFAT recognition 

10 sequences is completely blocked in T cells treated with ^ 
efficacious concentrations of cyclosporin A or FK506, with little 
or no specific effect on transcriptional enhancement involving 
recognition sites for other transcription factors, such as API 
and NF-icB (Shaw et al. (1988) Science 211: 202; Emmel et al. 

15 (1989) Science 246: 1617; Mattila et al. (1990) EMBO J 9: 4425). 

The presence of DNA' recognition sequence (s) for the 
NFAT complex appear to restrict transcription \ of linked DNA 
sequences, such as the lli-2 gene and other eeurly activation 
genes, so that transcription of these linked sequences is 

20 enhanced only in stimulated T lymphocytes (Durand et al. (1988) 
Mol. and Cell. Biol, £: 1715; Shaw et al. (1988) Science 241 : 
202; and Verwiej et al. (1990) J. Biol. Chem, 265 : 15788). 
Within the lL-2 enhancer, there are two NFAT recognition sites, 
a proximal and distal NFAT site. Elimination of an NFAT site 

25 from the IL-2 enhancer drastically reduces the ability of the IL- 
2 enhemcer element to function. In addition, tandem multimeric 
arrays of binding sites for the NFAT protein will direct 
transcription of linked sequences in activated T lymphocytes or 
In vityo transcription cocktails containing NFAT heterodimer, but 

30 not in other cell types or in transcription cocktails lacking 
NFAT. 

A distinguishing feature of the NFAT DNA binding site 
is its purine-rich binding site, e.g., 5'-AAGAGGAAAAA-3' . DNA 
sequence comparisons of the promoter/ enhancer regions of several 
35 genes that respond to T-cell activation signals has identified 
putative NFAT protein binding sites, such a comparison suggests 
that NFAT or a related family member may bind within the 
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promoter /enhancer regions of ther T-cell activation dependent 
genes. Most f these genes are sensitive to immunosuppressants, 
such as FK506 and cyclosporin* A list of putative NFAT binding 
sites follows in Table I: 



Purine Rich Core Sequences 

6AAA6GAGGAAAAACTGTTT 

CCAAAGAGGAAAATTT6TTT 

CAGAAGAGGAAAAATGAAGG 

TCCAGGAGAAAAAATGCCTC 

AAAACTTGTGAAAATACGTA 

TAAAGGAGAGAACACCAGCT 

GCAG6GTGGGAAAGGCCTTT 



TABLE I 
Position 
(-289 to -270) 
(-293 to -274) 
(-143 to -124) 
(-143 to -124) 
(-71 to -52) 
(-270 to -251) 
(-241 to -222) 



Gene 

himan IL-2 
murine IL-2 
human IL-2 
human IL-4 
human g-IFN 
HIV-LTR 
murine GM-CSF 



(AbbreviationB: 11.-2, interloukin 2; IL-4, intarleukin 4j HIV- 
LTR, human iinmunode£ici€mcy virus long terminal repeat; GM-CSP, 
granulocytQ-macrophage colony stimulating factor.) 
The intragenic enhancer of <M-CSF also comprises an NFAT site 
(Cocherill et al. (1992) Proc. WatX- Acad, Sci> iV.S.h.) . 

Inhibition of NFAT-dependent transcription can inhibit 
the process of T cell activation, leading to a reduction in T 
cell-mediated immime responses such as graft rejection. 
Unfortunately, cyclosporin A, PK506, and other currently used 
immunosuppressants possess several disadvantageous properties and 
produce undesirable side effects. It is believed that direct 
inhibition of NFAT-dependent gene transcription would provide 
superior immunosuppressant action and minimize undesirable side- 
effects. Unfortunately, there is at present no convenient method 
for selectively inhibiting NFAT-dependent gene transcription that 
is both efficacious and has minimal side effects. 



Glycoconiuaate DNA Liaands 
One class of small molecule drugs that has been used 
for human antineoplastic therapy are the naturally-occurring 
enediyne anticancer antibiotics which bind to and cleave DNA 
(Nicolaou KC and Dal WM (1991) Anaew . Chem , Int . Kd . Engl . Ifi: 
1387) . The enediyne family of anticancer antibiotics comprises 
the neocarzinostatin chromophore (Edo et al. (1985) Tetrahedron 
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Lett> ifi: 331)^ the calicheamiciris (Lee t al. (1987) J, Am. 
' ChgMf S9C,«, IflS: 3464), the esperaoicins (G liJc et al. (1987) 3^ 
Affit Chen, Sqg> jjia: 346I) , kedarcidin ( Proc. Natl, Acad, sel. 
(XJ.S.A.\ (1993) 2822), and the dynemlclns (Konishi et al. 

5 (1990) J. Am. Chem, sqc. 112: 3715) . The DNA binding properties 
of these enediyne antibiotics have been investigated (Zimmer C 
and Wahnert U (1986) Proa. Bio ohys. Molec, Biol. 31; Jarman 
M (1991) HatolCe aia: 566; Lu et al. (1991) J. Blomolec. struct. 
Pvn^ffit 2: 271; Sugiura et al. (1991) Biochemistry 10: 2989; 

10 Walker et al. (1992) Proc. Natl. Acad, scl. m.s.A.^ fig: 4608; 
Hawley et al. (1989) Proc. Natl. Acad. Sc i. (U.S.A.) fifi: 1105; 
Sugiura et al. (1990) Proc. Natl . Acad. Sci. /U.S.A. ) fil: 3831; 
Dedon PC and Goldberg IH (1992) Biochemistry 21: 1909; Nicolaou 
et al. (1992) Science 256: 1172; Drak et al. (1991) Proc. Natl. 

15 Acad, Sc i. ru.S.A.) M: 7464). Although some of these 
antibiotics (e«g., calicheamicln, esperimicln) bind DNA in a 
sequence-selective manner (Drak et al. (1991) op^cit. ) . they have 
not been shown to act as transcription modulators for specific 
genes, nor would they be expected to in view of their propensity 

20 to cleave DNA and lead to cytotoxicity. 

Various enediyne anticancer cmtibiotics such as 
calicheamicin 71^, dynemicin A, dynemicin H, calicheamicin T, 
esperimicin A^, esperimicin D, and the like can be obtained by 
standard fermentation and purification methods and/or synthesized 

25 or modified by standard organic synthetic procedures (Konishi et 
al. (1990) J. Am. Chem. Soc. 112 : 3715; Konishi et al. (1989) J\ 
Antibiot. 42 ; 1449; Sugiura et al. (1991) Biochemistry 30; 2989; 
Lee et al. (1987) J. Am. Chem. Soc. 109 : 3464; Lee et al. (1987) 
J. Am. Chem. Soc. 109 ; 3466; Golik et al. (1987) J. Am. Chem. 

30 Soc. 109 : 3462; Haseltine et al. (1991) J. Am. Chem. Soc. 113 ; 
3850; Nicolaou et al. J. Am. chem. Soc. 112 ; 8193; Lee et al. 
(1992) J. Am. Chem. Soc. 114 ; 985; Walker et al. (1992) Proc. 
Natl. Acad. Sci. (U.S.A.) 89: 4608; Halcomb et al. (1992) Ana. 
Chem.. Int. Ed. Enol. 31 ; 338; Halcomb et al. (1992) Ana. Chem. 

35 104; 314; Halcomb et al. (1991) J. Am. Chem. Soc. 113 ; 5080; 
Nicolaou et al. . (1992) Ana. Chem. , Int. Ed. Enal. 31; 855; 
Nicolaou t al. Ana . Chem , 104 ; 926; and Smith et al. (1992) it 
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Affigrt gh^ffii gQCt 111: 3134. 

The callcheamlclns and esperlmicins preferentially 
recognize specific sequences f nucle tides in doubl -stranded 
DNA, and cleave the chains at preferred sites. The structiural 
5 basis for the sequence selectivity of the calicheamicins and 
esperlmicins is not known, however interactions between the sugeu: 
portions of these molecules and specific sites on DNA may be 
involved (Zein et al. (1989) Science 244 ; 697; Walker et al. 
(1992) Proc. Natl. Acad. Scl. (\3.S.A.\ 4608; Drak et al. 

10 (1991) Proc. Natl. Acad. Scl. fU.S.A.^ 882 7464). Calicheamicih 
71^ binds to DNA in a sequencer-selective manner and then produces 
strand scission via rearrangement of its enediyne core to f03nn 
a reactive biradical. 

Enediyne compounds potently cleave DNA through 

15 bioreduction of the trisulfide and subsequent 1,4-benzenoid 
diradical formation, which results in hydrogen abstraction from 
adjacent nucleotide bases. The subsequent double strand DNA 
cleavage occurs at sites on each strand that are separated by 
three base pairs, suggesting a minor groove interaction. 

20 Modeling studies indicate that the stereochemical orientation of 
the aglycone relative to the carbohydrate recognition element 
permits a complementarity of fit on placement of calicheamicin 
into the minor groove and that specific interactions between the 
iodo substituent on the 2uromatic ring and adjacent N2 amino 

25 substituent s of the guanine dinucleotide may also contribute to 
the preferred sec[uence specificity. 

The structure of calicheamicin 71^ is shown below, with 
the calicheeunicin polysaccharide/ aryl backbone composed of sugars 
designated herein as rings A, B, D, and £ and a thiobenzoate 

30 moiety designated as ring C. The enediyne core moiety (a cyclic 
enediyne) is linked to the A sugar ring through an ether linkage. 
The eyediyne is capable of rearranging to generate an aromatic 
ring structure with two chemically active radical sites (a 1,4- 
benzenoid diradical) following bioreduction of the trisulfide 

35 residue of the enediyne core. The two unbonded electrons of the 
biradical ring can abstract hydrogen atoms from the sugar 
phosphate backbone of DNA, resulting in doxibl strand breakage. 
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CH3O £ 



In one aspect of the invention, glycocon jugate DMA 
ligands having the backbone structure of calicheamicin (i.e., 
5 comprising rings A, B, C, D, and/or E) and lacking the reactive 
enediyne core moiety are provided for modulating transcription 
of a gene by sequence-selective DNA binding to a DNA-binding 
protein recognition sequence. Such binding of the 

glycocon jugate DNA ligand to the recognition site(s) thereby 

10 alters (typically by inhibiting) binding of a DMA-binding protein 
to its recognition site and affects (usually negatively) 
transcription dependent upon the DMA-binding protein (e.g. , NFAT- 
dependent transcription if the glycocon jugate binds to a NFAT 
site)* In alternative embodiments, the glycocon jugate DMA 

15 ligands comprise a subset of the backbone structure of 
calicheamicin, such as comprising rings A, B, and C but lacking 
the D and E rings; or comprising rings E, A, and B but lacking 
the C and D rings, as well as other variations. Glycocon jugate 
DMA ligands having a variety of backbone structures other than 

20 calicheamicin and esperimicin are also within the scope of the 
present invention, although glycocon jugate DNA ligands comprising 
a backbone structure of calicheamicin are believed preferred for 
inhibition of NFAT-dependent transcription and sequence-selective 
binding to NFAT recognition sites. 

25 A preferred class of such calicheamicin-related 

glycoconjugate DNA ligands have the general structural formula 
(Structure I) : 
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where Is hydrogen , halogen > hydroxy , lower alky 1 lower 
alkoxy, or aryloxy, preferably methyl, ethyl, propyl, butyl, 
methoxy, ethoxy, propyloxy, phenoxy, or benzyloxy; R2 is a 
halogen, hydroxy, or lower alkyl, preferably lodo, bromo, chloro, 
5 fluoro, methyl, or ethyl; R3 Is lower alkyl or lower alkoxy, 
preferably methyl, ethyl, methyloxy, or ethyloxy, or hydrogen; 
and R4 and R5 are Independently selected from lower alkyl or 
lower alkoxy, preferably methyl, ethyl, .methyloxy, ethyloxy, or 
hydrogen. It Is believed that, in alternative embodiments, the 

10 R^ substituent may comprise any of a variety of non-interfering 
substituents (e.g., alkyl, aryl, arylaUcyl, aryloxy, substituted 
alkoxy, alkanol, halogen, heterocyles, - or hydrogen). The 
ethylamino group in Structure I and derived subclasses can be 
optionally replaced by R^-N-R^, wherein R5 and R7 are hydrogen, 

15 lower alkyl, or alkyl. 

A particularly preferred class of calicheamicin-related 
glycocon jugate DNA ligands comprise the following structural 
formula (Structxire II) : 
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wher is a non-lnterferlng substltuent, preferably hydrogen, 
hydr xy, lower alkyl, 1 wer alk xy, aryloxy, or allcanol, more 
preferably methyl , ethyl , propyl , butyl , methoxy , ethoxy , 
propyloxy, or ethylhydroxyl; and is a halogen, hydrogen, lower 
5 alkyl or lower alkoxy, preferably iodo, bromo, fluoro, chloro, 
methyl, ethyl, or methoxy. Typically, R^ is hydroxy, methoxy, 
or ethoxy. Typically, R2 is iodo, bromo, chloro, or fluoro; most 
typically iodo. 

A preferred subclass of calicheamicin-related 
10 glycocon jugate DNA ligands have the structural formula (Structtire 
III) : 




CHgO g 



Where R^ is a non-interfering substituent, preferably hydrogen, 
hydroxy, lower alkyl, lower alkoxy, aryloxy, or alkanol, more 
preferably methyl, ethyl, propyl, butyl, methoxy, ethoxy, 
15 propyloxy, hydroxy, halogen, or hydrogen. 

An especially preferred species of Strxicture I, 
Structure II, and Structure III is the methyl glycoside 
derivative of the aryltetrasaccharide, calicheamicin H6 (CIiM-*H6) : 
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Calicheamicin HG exhibits sequence-selective binding to DNA 
(e.g., pr fer ntially binding t TCCT, TTTT) and siibstantially 
lacks ONA cl avage activity. Calicheaaicin KG lacks the en diyne 
core Boiety present in calicheamicin yi^ that forms a reactive 
5 biradical species upon bioreductlon (e.g., with NADPH) ; thus, 
calicheamicin MG sxibstantially lacks DNA cleavage activity but 
exhibits reversible non-covalent binding to DNA in a sequence- 
selective manner. Calicheamicin HG and other sequence-selective 
congeners according to Structures I, II, and III can be used to 

10 displace, disrupt, or prevent binding of a DNA-binding protein, 
such as a sequence-specific transcription factor (e.g., NFAT) , 
to their cognate DNA recognition sites which comprise a sec[uence 
(e.g., TCCT) to which calicheamicin MG (or other calicheamicin- 
related glycocon jugate DNA ligand of Structure I, II, or III) 

15 preferentially binds . These calicheamicin-related glycocon jugate 
DNA ligands do not contain reactive enediyne moieties and are 
thus suited for vsirious uses, including as pharmaceuticals and 
commercial research reagents. 

The sequence-selectivity of the calicheamicin-related 

20 glycocon jugate DNA ligands of the invention can vary on the basis 
of the primary structure of the glycocon jugate DNA ligand. This 
provides an advantageous basis for choosing non-reactive DNA- 
binding agents which possess reversible sequence-selective 
binding to a particular predetermined DNA sequence. For example, 

25 calicheamicin MG preferentially to the TCCT and TTTT sequences 
which are present in, for example, an NFAT recognition site. 
Calicheamicin MG acts as a reversible ligand for binding to the 
TTTT and/or TCCT sequence (s) in an NFAT recognition site and 
inhibits binding of an NFAT complex to its recognition site. 

30 This inhibition of binding of NFAT to its recognition site 
modulates transcription, typically by inhibiting NFAT-dependent 
transcription of NFAT-dependent structural genes (e.g. , the human 
IL-2 gene) . This advantageous feature of calicheamicin MG 
provides the basis for its use as a modulator of the 

35 transcription of NFAT-dependent genes both in vivo and in vityg, 
emd for the use of calicheamicin MG as a commercial research 
reagent for the elution of preparative quantities of NFAT 
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prot in(s) from DMA affinity chromatography resin employing NFAT 
recognition sites in the DNA, among other uses. 

Sequence-Selective GlycocQn-)uaate DNA Liaands 
5 Structural congeners of esperimicln and calicheamicin 

which lack reactive enediyne moieties can . by synthesized 
according to conventional synthetic methods known to those 
skilled in the art, as well as other methods provided herein. 
Preferred stiructural congeners of calicheamicin are those having 

10 a structural formula of Structure I, II, or III. Other 
structural congeners comprise, for example, the basic structiire 
of calicheamicin H6 or Structure I, II, or III but which lack: 
(1) the E sugar ring, (2) the D sugar ring, or (3) both the D and 
E sugar rings. Additional structural congeners comprising the 

15 basic structure of CLM-MG or of staructures I, II, or III, but 
which have been multlmerlzed in varying combinations (e.g., 
strcture I linked to structure II, etc.) may be used. 

The degree of sequence-selective binding exhibited by 
each such structural congener is determined by assay, wherein a 

20 predetermined DNA-blndlng protein (preferably a transcription 
factor) and at least one polynucleotide comprising a known 
recognition sequence for the predetermined DNA-blndlng protein 
are contacted under binding conditions in the presence of the 
structural congener. Typically, the binding reaction also 

25 contains at least one alternative species of DNA-blndlng protein 
and at least one alternative polynucleotide having a cognate 
recognition site for the alternative DNA-blndlng protein; the 
predetermined DNA-blndlng protein and the alternative DNA-blndlng 
proteln(s) bind to distinct DNA sequences. Optionally, the 

30 alternative DNA-blndlng protein (s) and the cognate alternative 
DNA recognition site(s) may comprise a separate, parallel binding 
reaction. A control set of binding reactions lacking the 
structural congener are included for calibration of binding 
conditions. 

35 Usually, the polynucleotide comprising the recognition 

slte(s) for the predetermined DNA-binding protein is 
distinguishable from the polynucleotide comprising the 
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alternative DNA-binding protein recogniti n site(s) (e.g., th y 
may be of different lengths and electrophoretically separable) • 
Usually, ne or more of the polynucleotides is/ are labell d with 
a detectable label (e.g./ ^^P, ^^S) and/ or one or more of the 
5 DNA-binding proteins (the predetermined DNA-binding protein 
and/ or an alternative DNA-binding protein) is/are also labeled 
with a detectable label (e.g., radioiodination with ^^^I or ^^^I 
by chloramine T/Bolton-Hunter reaction, radiolabeling by 
incorporation of ^^S-methionine, ^^C-leucine, and the like) . The 

10 differential ability of the structural congener to inhibit or 
displace binding of the predetermined DNA-binding protein to its 
cognate recognition sec[uence as compared to the structural 
congener's ability to inhibit or displace binding of the 
alternative DNA-binding protein (s) to its/their cognate DNA 

15 recognition site(s) is determined. Various methods for 
performing such a determination of differential binding 
inhibition are available. 

Binding assays generally take one of two forms: 
immobilized target DNA having the recognition sequence can be 

20 used to bind labeled DNA-binding protein (s) , or conversely, 
immobilized DNA*binding protein (s) can be used to bind labeled 
target DNA comprising the cognate recognition sec[uence(s) . In 
each case, the labeled macromolecule (protein or DNA) is 
contacted with the immobilized macromolecule (respectively, DNA 

25 or protein) under aqueous conditions that permit specific binding 
of the DNA-binding protein (s) to the target DNA in the absence 
of the structural congener (glycoconjugate DNA ligemd) . 
Particular aqueous conditions may be selected by the practitioner 
according to conventional methods, including methods employed in 

30 DNA-protein footprinting and/or in vitro nuclear run-on 
transcription. For general guidance, the following buffered 
aqueous conditions may be used: 20-200 mM NaCl, 5-50 mM Tris HCl, 
pH 5-8, and nanomolar to millimolar concentrations of Zn"*"^ and, 
optionally, Mg"^^ and/or Mn*^. it is appreciated by those in the 

35 art that additions, deletions, modifications (such as pH) and 
substitutions (such as KCl substituting for NaCl or buffer 
substitution) may be made to these basic conditions. 
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Modifications can be made to the basic binding reaction 
conditions so 1 ng as specific binding of the DNA-binding 
protein (s) to target DNA occxirs in the control reaction (s). 
Conditions that do not permit specific binding in control 
5 reactions (no glycocon jugate DNA ligand included) are not 
suitable for use in DNA binding assays. . Conventional 
electrophoretic "mobility shift" assays (Fried M and Crothers D 
(1981) Nucleic Acids Res, £: 6505; Gamer M and Rezvin A (1981) 
Nucleic Acids Res> 9: 3407; Strauss F and Varshavsky A (1984) 
10 21* 889) may be used to calibrate the binding reaction 

conditions for satisfactory binding of reagents. However, it is 
believed that assays using an immobilized component (DNA or DNA- 
binding protein) will be preferred for most large--8cale screening 
applications. 

15 In embodiments where target DNA containing a DNA- 

binding protein recognition site is immobilized, preferably 
double-stranded DNA containing at least one recognition site 
sequence is bonded, either covalently or noncovalently, to a 
substrate. For example, but not for limitation, DNA can be 

20 covalently linked to a diazotized substrate, such as diazotized 
cellulose, particularly diazophenylthioether cellulose and 
diazobenzyloxymethyl cellulose (Alwine et al. (1977) Proc. Natl> 
Acad, sci . (JJ.S.A.\ H: 5350; Reiser et al. (1978) Biochem. 
Biophva , Res , Commun . J|£: 1104 ; Stellwag and Dahlberg (1980) 

25 Nucleic Acids Res. £: 299, which are incorporated herein by 
reference) . Alternatively, DNA can be covalently linked to a 
substrate by partial ultraviolet light-induced crosslinking to 
a Nylon 66 or nitrocellulose substrate (Church and Gilbert (1984) 
Proc. Natl. Acad. Sci. (U.S.A. ^ 81 : 1991, which is incorporated 

30 herein by reference) . Also, for example and not for limitation, 
DNA can be noncovalently bound to a Nylon 66 or other highly 
charged anionic sxibstrate (Berger and Kimmel, Methods in 
Enzvmoloav. Volume 152. Guide to Molecular Cloning Techniques 
(1987), Academic Press, Inc., San Diego, CA) . In some 

35 embodiments, it is preferable to use a linker or spacer to reduce 
potential steric hindrance from the siibstrate. The immobilized 
DNA is contacted with the labeled DNA-binding protein under 
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aqu us binding c ndltlons. 

Preferably, at leas-t one DNA-binding protein species 
is labeled with a detectable marker. Suitable labeling includes, 
but is not limited to, radiolabellng by incorporation of a 
5 radiolabeled amino acid (e.g., ^^C-labeled leucine, ^H-labeled 
glycine, ^^S-labeled methionine), radiolabellng by post* 
translational radioiodination with ^^^I or ^^^I (e.g., Bolton- 
Hxmter reaction and chloramine T) , labeling by post-translatlonal 
phosphorylation with ^^P (e.g., phosphorylase and Inorganic 

10 radiolabeled phosphate) fluorescent labeling by Incorporation of 
a fluorescent label (e.g., fluorescein or rhodamine) , or labeling 
by other conventional methods known in the eurt. Isotopes of Zn 
(Mn or Mg) may also be used, such as for zinc-finger proteins, 
and the like. In embodiments where the target DNA is immobilized 

15 by linkage to a substrate, the predetermined DNA*blnding protein 
(and the alternative DNA-binding protein) is labeled with a 
detectable marker. 

Additionally, in some embodiments a DNA-bindlng protein 
may be used in combination with an accessory protein (e.g., a 

20 protein which forms a transcription complex with the DNA-bindlng 
protein in vivo ) , it is preferred that different labels are used 
for each polypeptide species , so that binding of individual 
and/or heterodlmerlc and/or multimeric complexes to target DNA 
can be distinguished, as can binding of the alternative DNA- 

25 binding protei'n(s) , if present in the same binding reaction. For 
example but hot limitation, a predetermined DNA-bindlng protein 
may be labeled with fluorescein and an accessory polypeptide (or 
an alternative DNA-bindlng protein) may be labeled with a 
fluorescent marker that f luorescesces with either a different 

30 excitation wavelength or emission wavelength, or both. 
Alternatively, double-label scintillation counting may be used, 
wherein the predetermined DNA-bindlng protein is labeled with one 
isotope (e.g., ^H) and a second polypeptide species (accessory 
protein or alternative DNA-bindlng protein) is labeled with a 

35 different isotope (e.g. , ^"^C) that can be distinguished by 
scintillation counting using discrimination techniques. 

Labeled DNA-bindlng proteins are contacted with 
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ism blllzed target DNA under aqueous conditions as described 
infra • The tiae and temperature of incubation of a binding 
reaction Biay be varied, so long as the selected conditions permit 
specific binding to occur in a control reaction where no agent 
5 is present. Preferable embodiments employ a reaction temperature ' 
of about at least 15 degrees Centigrade, more preferably 35 to 
42 degrees Centigrade, and a time of incubation of approximately 
at least 15 seconds, although longer incubation periods are 
preferable so that, in some embodiments, a binding equilibrium 

10 is attained. Binding kinetics and the thermodynamic steUaillty 
of bound protein: DNA complexes determine the latitude available 
for varying the time, temperature, salt, pH, and other reaction 
conditions. However, for any particular embodiment, desired 
binding reaction conditions can be calibrated readily by the 

15 practitioner using conventional methods in the art, which may 
include binding analysis using Scatchard analysis. Hill analysis, 
and other methods (PyotQl^s, stg^ctuy^s aii4 MolQcuI.qr yyjti>cj,pl^g, 
(1984) Creighton (ed.), W.H. Freeman and Company, New York). 

Specific binding of labeled DNA-binding protein to 

20 immobilized DNA is determined by including unlabeled competitor 
protein (s) (e.g., albumin) and/or free competitor DNA or 
competitor oligonucleotides. After a binding reaction is 
completed, labeled DNA-binding protein (s) that is/are 
specifically bound to immobilized target DNA is detected. For 

25 example and not for limitation, after a suitable incubation 
period for binding, the aqueous phase containing non- immobilized 
protein and nucleic acid is removed and the substrate containing 
the target DNA and any labeled protein bound to the DNA is washed 
with a suitable buffer, optionally containing unlabeled blocking 

30 agent (s), and the wash buffer (s) removed. After washing, the 
amount of detectable label remaining specifically bound to the 
immobilized DNA is determined (e.g., by optical, enzymatic, 
autoradiographic, or other radiochemical methods) . 

In some embodiments , addition of unlabeled blocking 

35 agents that inhibit non-specific binding are included. Examples 
of such blocking agents include, but are not limited to, the 
following: calf thymus DNA, salmon spem DNA, yeast RNA, mixed 
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sequence (ramdom or pseudorandom sequence) lig nucleotides of 
various lengths, bovine serum albumin, nonionic detergents (NP- 
40, Tween, Triton X-lOO, etc.)r nonfat dry milk proteins, 
Denhardt^s reagent, polyvinylpyrrolidone, Ficoll, and other 
5 blocking agents. Practioners may, in their discretion, select 
blocking agents at suitable concentrations to be included in DNA 
binding assays; however, reaction conditions are selected so as 
to permit specific binding between the predetermined DNA-*binding 
protein and target DNA containing its recognition sequence in a 

10 control binding reaction. Blocking agents are included to 
inhibit nonspecific binding of labeled DNA-blnding protein to 
isnaobilized DNA and/or to inhibit nonspecific binding of leQseled 
DNA to Immobilized DNA-binding protein. 

In embodiments where the DNA-binding protein is 

15 immobilized, covalent or noncovalent linkage to a substrate may 
be used. Covalent linkage chemistries include, but are not 
limited to, well-characterized methods known in the art (Kadohaga 
and Tijan (1986) pypg, y^Ut Acad, Spj. fV.gtAt) 5889, which 
is incorporated herein by reference) . One example, not for 

20 limitation, is covalent linkage to a substrate derivatized with 
cyanogen bromide (such as CNBr-derivatized Sepharose 43} It may 
- be desirable to use a spacer to reduce potential steric hindrance 
from the substrate. Noncovalent bonding of proteins , to a 
substrate include, but are not limited to, bonding of the protein 

25 to a charged surface and binding with specific antibodies. DNA 
is typically labeled by incorporation of a radioleUaeled 
nucleotide (^H, ^*C, ^^S, or a biotinylated nucleotide that 

can be detected by labeled avidin (e.g. , avidin containing a 
fluorescent marker or enzymatic activity) • 

30 Frequently, a gel mobility shift assay is performed 

wherein the bound complex comprising the predetermined DNA- 
binding protein and the polynucleotide comprising the 
predetermined recognition site can be distinguished from unbound 
polynucleotide comprising the predetermined recognition site; 

35 also usually the bound complex comprising the alternative DNA- 
binding protein and the polynucleotide comprising the cognate 
alternative recognition site can be distinguished from unbound 
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polynucleotide comprising the alternative rec gnition site^ 
Binding reactions comprising either no glycoconjugate DNA ligand 
(control) or varying concentrations of the structural cong ner 
are incubated and electrophoresed. Glycoconjugate DNA ligands 
5 that selectively inhibit or displace binding of the predetermined 
DNA-binding protein to its recognition site will reduce the 
relative abundance of the bound complex comprising the 
predetermined DNA-binding protein and the polynucleotide 
comprising the predetermined recognition site (a retarded 

10 mobility band) , pref ereO^ly in a concentration^dependent 
relationship. Preferably, the glycoconjugate DNA ligand 
substantially will not displace or inhibit binding of the 
alternative DNA-binding protein ( s ) to the polynucleotide ( s ) 
comprising the cognate alternative recognition site(6) at the 

15 lowest concentrations of glycoconjugate DNA liigand which produces 
detectable inhibition of binding of the predetermined DNA-binding 
protein to its recognition site. 

Novel sequence-selective glycoconjugate DNA ligands 
that bind preferentially to a specific DNA sequence can be 

20 identified by screening a library of glycogen jugate DNA ligands 
with a predetermined DNA sequence. For example, such a method 
generally comprises the steps of (1) generating a library of 
glycoconjugate DNA ligands of varying structural complexity by 
mixed chemical synthesis techniques known in the art, (2) 

25 attaching the library members to or synthesizing the library 
members directly on a solid phase support (e.g., beads comprising 
polystyrene crosslinked with 1 percent divinylbenzene) , such that 
each unic[ue ligand (library member) is present on discrete solid 
support or is otherwise spatially defined, (3) contacting the 

30 glycoconjugate DNA ligand library with a labeled (e.g., with a 
radiolabel or fluorescent label) polynucleotide comprising a 
predetermined sequence (e.g., a recognition sequence to which a 
predetermined sec[uence-selective DNA-binding protein binds) , (4) 
identifying library members that selectively bind to (e.g., 

35 preferentially bind as compared to competitor random-secfuence 
DNA, if included) the labeled polynucleotide, and (5) optionally 
determining the structure of the library member glycoconjugate 
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DNA llgand bound to the labeled polynucleotide. Solid phase 
oligosaccharide synthesis has been described (Danishefslcy et al. 
Science iJSfl: 1307, incorporated her in by reference) and may be 
used to generate combinatorial glycocon jugate libraries on solid 
5 substrates. 

DNA sequences that bind to de novo synthesized 
glycocon jugate compounds that lack a predetermined DNA 
recognition site can be identified by screening a library of DNA 
sequences with a unique glycocon jugate ligand. Such methods 

10 generally comprise the steps of: (1) synthesizing a 
glycocon jugate ligand (e.g., according to a structural formulae 
provided herein), (2) attaching the glycocon jugate ligand to or 
synthesizing directly on a solid phase support, (3) contacting 
the glycocon jugate ligand with a library of polynucleotides 

15 comprising a variable region sequence of r€mdom, pseudorandom, 
or defined set sequences flanked by predetermined constant region 
of specific sequence (s) suitable for hybridizing to a PGR 
primer(s), such contacting is. performed under suitable binding 
conditions (e.g. , physiological conditions) , (4) selecting the 

20 polynucleotides botind to the immobilized glycocon jugate ligand, 
(5) amplifying the selected polynucleotide sequences (e.g., by 
PGR), and (6) identifying the selected variable secpiences by 
sequencing the PGR product(s). 

25 yranscription Assays 

Glycocon jugate DNA ligands can inhibit (or otherwise 
modify or modulate) the function of DNA-binding proteins that are 
transcription factors (e.g., NFAT, HNF-1, OCT-l/OAP) by 
interfering with their binding to their recognition DNA 

30 sequences, preferably withbut substantial interference with the 
function of other transcription factors or normal cell 
physiology. If the specificity of transcriptional inhibition is 
sufficiently high, such that transcription of genes dependent on 
one particular transcription factor (e.g., NFAT) are 

35 differentially inhibited to a sufficiently greater extent than 
genes having transcription dependent on essential or 
"housekeeping" transcription factors (e.g., Spl, API), then the 
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differential inhibition of transcription may be exploited for 
therapeutic gene modulation, such as for m dical treatment or 
vet rinary use (e.g., for inducing or repressing transcription 
of selected transgenes under the transcriptional control of a 
5 transcription factor recognition sequence in transgenic animals) « 
Thus, the potency, seguence'-^selectivity, and efficacy 
of a glycoconjugate DNA ligand to differentially inhibit 
transcription driven by one particular transcription factor (or 
subgroup of transcription factors) can be readily evaluated using 

10 transcription assays. 

Structural congeners which exhibit sequence-selective 
binding to a sequence present in a predetermined transcription 
factor recognition sequence can be further evaluated for their 
abilities to specif ically modulate (e.g., inhibit) transcription 

15 of a polynucleotide sequence operably linked to a transcriptional 
regulatory sequence under the transcriptional influence of the 
predetermined transcription factor recognition sequence . In 
vitro transcription reactions may be used, if available, to 
ascertain the ability of a structural congener to modulate (e.g., 

20 inhibit) the transcription of a template imder the 
transcriptional influence of a cis-^linked predetermined 
transcription factor recognition sequence in the presence of the 
predetermined transcription factor (and other requisite 
transcription reaction components) , wherein the modulation 

25 (inhibition) is determined relative to a control transcription 
reaction lacking the structural congener. Usually, one or more 
alternative transcription factors and polynucleotide sequences 
under the transcriptional influence of their cognate alternative 
recognition sequence (s) are assayed in parallel transcription 

30 reactions or in a single transcription reaction containing both 
the predetermined and the alternative DNA-binding proteins and 
recognition sequences. 

Alternatively, or in combination with in vitro 
transcription assays, in vivo transcription assays may be 

35 performed. Eukaryotic or prokaryotic cells, preferably 
mammalian, insect, bacterial {e.g.,\F. coli) , yeast, or plant 
cells, are employed as reporter host cells. Reporter host cells 
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comprise a reporter polynucleotide sequence (e.g., luciferase, 
i9-galact08idase, IL-2, or other d tectable sequence) under the 
transcripti nal inf lu nee of an operably linked cis-acting 
transcription factor recognition sequence (e.g., an NFAT 
5 recognition site) . Generally, the reporter host cells are stably 
or transiently transf ected with one or more species of reporter 
polynucleotide according to any of the various methods and 
vectors known in the art. Preferably, the reporter host cells 
comprise a reporter polynucleotide under the transcriptional 

10 control of a predetermined transcription factor recognition 
sequence (e.g., NFAT site) and one or more alternative reporter 
polynucleotide (s) xuider the transcriptional control of 
alternative transcription factor recognition site(6) (e.g., Spl, 
API sites) , wherein the alternative reporter polynucleotide (s) 

15 produce a trsmscribed signal (e.g., mRNA sequence, enzyme, etc.) 
that is distinguishable from the transcribed signal of the 
predetermined reporter polynucleotide. 

Reporter host cells are cultured iinder suitable cultxire 
conditions for driving transcription of the linked reporter 

20 sequences under the Influence of the linked transcription factor 
recognition sequence (s) (e.g., NFAT-dependent transcription is 
assayed in stimulated T lymphocyte lines) . A glycocon jugate DNA 
ligand is added to the cell cultures in varying concentrations 
or dosages (i.e., for establishing a dose--response relationship) 

25 and the ability of the glycocon jugate DNA ligand to 
preferentially or specifically modulate (e.g., inhibit) the 
transcriptional activity of the predetermined reporter 
polynucleotide as compared to the alternative reporter 
polynucleotide (s) is determined. Structural congeners which 

30 preferentially or selectively inhibit transcription of the 
predetermined reporter polynucleotide as compared to the 
alternative reporter polynucleotide (s) are thereby identified as 
selective transcriptional antagonists. Preferably, such 
selective transcriptional antagonists do not produce siibstantial 

35 adverse effects (e.g., decreased cell viability, neoplastic 
transformation, expression of undesirable phenotypes) on the 
cultured cells at the lowest dose levels required for detectable 
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selective inhibition of treuiscription f the predetermined 
r porter polynucleotide. Generally, sel ctive transcriptional 
antagonists are candidate pharmaceuticals f r therapeutic 
transcriptional modulation of endogenous genes under the 
5 transcriptional influence of a cis-acting predetermined 
transcription factor recognition sequence (e.g.., NFAT-dependent 
genes) . 

Selective transcriptional antagonists are administered 
to nonhiiman animals (e.g., mice, rats, rabbits, hamsters, 

10 nonhuman primates) to determine safety, efficacy, and selectivity 
of transcriptional modulation of the desired endogenous genes in 
the desired cell type and organs. The therapeutic effect of 
treating disease conditions by such gene modulation is evaluated 
and maximum tolerated dose, LDgo^ and a dose-response curve is 

15 generated to calibrate efficacious dose levels and routes of 
administration. In one variation, calicheamicin MG is employed 
to selectively inhibit transcription of NFAT-dependent genes and 
thereby inhibit T cell activation and effect immunosuppression 
of cell-mediated immune response ' (e.g. , allograft rejection) In 

20 YiVQ> 

Commercial Research Reagent 
In addition to their utility as therapeutic agents for 
treating hximan diseases, «glycpcon jugate DNA ligands, such as the 

25 use of calicheamicin MG and related structural congeners for 
selective antagonism of NFAT-dependent gene transcription and 
immxinosuppression, this glycocon jugate DNA ligands will find use 
as commercial reagents. 

For example and not limitation, NFAT can be purified 

30 in preparative quantities by methods wherein a purification step 
involves affinity chromatography of a sample containing NFAT 
(e.g., an extract from activated T lymphocytes) on a matrix 
containing DNA sequences comprising NFAT binding sites to which 
calicheamicin MG selectively binds. The sample containing NFAT 

35 is contacted with the affinity matrix under binding conditions 
permitting the binding of NF-At to its DNA recognition site(s) 
on the matrix DNA. Unbound sample is removed, and the matrix is 
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ptlonally washed with a suitable wash buffer. As an enrichment 
step, a solution comprising an effective concentration ( r 
concentration gradient) of calicheamicin MG is applied to th 
bound matrix to selectively displace, and thus elute, NFAT from 
5 its recognition site(s) on the matrix DNA without eluting all 
DNA-binding proteins bound to the DNA matrix.. The eluate is 
enriched for NFAT. 

Alternatively, calicheamicin H6 may be used for 
titrating NFAT binding in gel shift assays, DNA footprinting, 
10 nuclease-protection assays, and the like. Many such assays may 
have diagnostic as well as research applications. 

It is envisioned that a significant commercial market 
will develop for calicheamicin MG as a research reagent (much as 
restriction endonucleases are widely sold for research and 
15 diagnostic uses), particularly in view of the desirability of 
using purified NFAT for transcription assays, binding assays, and 
the like for large-scale drug screening programs for novel 
immunosuppressants, and the like. 

20 Pharmaceutical Compositions 

Glycocon jugate DNA ligands, such as the non-cytotoxic 
calicheamicin-related compounds of Structures I, II, and III, and 
calicheamicin MG, can be used as pharmaceuticals for effecting 
therapeutic and/ or prophylactic transcriptional modulation of 

25 predetermined genes In vivo . For example, calicheamicin MG can 
act as an immunosuppressant by selectively inhibiting NFAT- 
dependent transcription of T cell activation genes (e.g., IL-2) . 
Calicheamicin MG and other structural congeners of calicheamicin 
and esperimicin may exhibit selective agonism or antagonism of 

30 the expression of other genes, for example proto-oncogenes, 
oncogenes, tumor suppressor genes, genetic disease alleles, 
pathogen-related genes, and transgenes 

The preferred pharmaceutical compositions of the 
present invention comprise a therapeutically or prophylactically 

35 effective dose of at least one glycocon jugate DNA ligand, such 
as calicheamicin MG. 

Pharmaceutical comp sitions comprising a glycoconjugate 
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DMA ligand of th . present invention are useful for topical and 
parenteral administration, i>e, , subcutaneous ly, intreuauscularly 
or intraven usly. The finding that glycocon jugate DNA ligands 
possess sequence-specific transcriptional inhibition in vitro as 
5 well as In vivQ without substantial cytotoxicity indicates that 
calicheamicin-related glycocon jugate DNA ligands. are suitable for 
pharmaceutical use. The glycocon jugate DNA ligands of the 
invention are suitable for administration to mammals, including 
human patients and veterinary patients, 

10 The compositions for parenteral administration will 

commonly comprise a solution of a glycoconjugate DNA ligand or 
a cocktail thereof dissolved in an acceptable carrier, preferably 
an aqueous carrier. It is often preferable to include in the 
carrier a hydrophobic base (e.g. , polyethylene glycol, Tween 20) . 

15 A variety of aqueous carriers can be used, e>g. ^ water, buffered 
water, 0.4% saline, 0.3% glycine and the like. These solutions 
are sterile and generally free of particulate matter. These 
compositions may be sterilized by conventional, well known 
sterilization techniques. The compositions may contain 

20 pharmaceutically acceptable auxiliary substances as required to 
approximate physiological conditions such as pH adjusting and 
buffering agents, toxicity adjusting agents and the like, for 
example sodium acetate, sodium chloride, potassium chloride, 
calcium chloride, sodium lactate, etc. The concentration of the 

25 glycoconjugate DNA ligand(s) in these formulations can vary 
widely, i.e. . from less than about 1 nH, usually at least about 
O.lmH to as much as 100 mH and will be selected primarily based 
on fluid volximes, viscosities, etc., in accordance with the par- 
ticular mode of administration selected. Most usually, the 

30 glycoconjugate DNA ligand is present at a concentration of 0.1 
/xM to 10 mM. For example, a typical formulation for intravenous 
injection comprises a sterile solution of a non-cytotoxic 
glycoconjugate DNA ligand at a concentration of 5 mM in Ringer's 
solution. A hydrophobic vehicle may be used, or an aqueous 

35 vehicle comprising a detergent or other lipophilic agent (e.g., 
Tween, NP-40, PEG) ; alternatively, the glycoconjugate DNA 
ligand (s) may be administered as a suspension in an aqueous 
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carrier, or as an emulsl n* 

Thus, a typical pharmaceutical composition for 
intramusculeu: injection could be made up to contain l ml st rile 
buffered water, and about 1-100 mg of glycocon jugate DNA ligand. 
5 A typical composition for intravenous infusion can be made up to 
contain 250 ml of sterile Ringer's solution, and about 100-1000 
mg of glycocon jugate DNA ligand (s) . Actual methods for preparing 
parenterally administrable compositions will be known or apparent 
to those skilled in the art and are described in more detail in, 

10 for example. Remington's Pharmaceutical Science, 15th Ed., Hack 
Publishing Company, Easton, Pennsylvania (1980) , which is 
incoz^orated herein by reference. A typical pharmaceutical 
composition for topical application can be made with suitable 
dermal ointments, creams, lotions, ophthalmic ointments and 

15 solutions, respiratory aerosols, and other excipients. 
Excipients should be chemically compatible with the 
glycocon jugate DNA ligands that are the active ingredient (s) of 
the preparation, cuid generally should not increase decomposition, 
denaturation, or aggregation * of active ingredient (s) . 

20 Frequently, excipients will have lipophilic components such as 
oils and lipid emulsions. 

The transcriptional antagonist glycocon jugate DNA 
ligands of this invention can be lyophilized for storage and 
reconstituted in a suitable carrier prior to use. It will be 

25 appreciated by those skilled in the art that lyophilization and 
reconstitution can lead to varying degrees of activity loss, and 
that use levels may have to be adjusted to compensate. 

The compositions containing the present glycoconjugate 
DNA ligands (e.g., selective transcription antagonists} or 

30 cocktails thereof can be administered for prophylactic and/or 
therapeutic treatments. In therapeutic application, compositions 
are administered to a patient already affected by the particular 
disease, in an amount sufficient to cure or at least partially 
arrest the condition and its complications my modulating 

35 expression of one or more predetermined genes. An amount 
adequate to accomplish this is defined as a "therapeutically 
effective dose" or "efficacious dose." Amounts effective for 
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this use will depend up n the severity of the condition, the 
g neral state of the patient, and the r ute of administration, 
but generally range from about 1 mg to about lOg of 
glycocon jugate DNA ligand per dose, with dosages of from 10 mg 
5 to 2000 mg per patient being more commonly used. For example, 
for treating acute tissue graft rejection reactions, about 10 to 
1000 mg of a calicheamicin MG or a congener may be administered 
systemically by intravenous infusion* 

In prophylactic applications, compositions containing 

10 the glycocon jugate DNA ligands or cocktails thereof are 
administered to a patient not already in a disease state to 
enhance the patient's resistance or to retard the progression of 
disease. Such an amount is defined to be a **prophylactically 
effective dose," In this use, the precise amounts again depend 

15 upon the patient's state of health and general level of immunity, 
but generally range from 1 mg to 10 g per dose, especially 10 to 
1000 mg per patient. A typical formulation of a glycocon jugate 
DNA ligand used as a selective transcriptional antagonist, such 
as calicheamicin HG, will contain between about 25 and 250 mg of 

20 the glycocon jugate in a unit dosage form. 

Single or multiple administrations of the compositions 
can be ceurrled out with dose levels and dosing pattern being 
selected by the treating physician. In any event, the 
pharmaceutical formulations should provide a quantity of the 

25 glycocon jugate DNA ligand (s) of this invention sufficient to 
effectively treat the patient. Typically, at least one species 
of glycocon jugate DNA ligand is administered as the sole active 
ingredient, or in combination with one or more other active 
ingredients. In general for treatment of gene-expression related 

30 disease (e.g, autoimmiinity, neoplasia, senescence, congenital 
genetic disease) , a suitable effective dose of the glycocon jugate 
transcriptional modulator will be in the range of 0.01 to 1000 
milligram (mg) per kilogram (kg) of body weight of recipient per 
day, preferably in the range of 1 to 100 mg per kg of body weight 

35 per day. The desired dosage is preferably presented in one, two, 
three, four or more subdoses administered at appropriate 
intervals throughout the day. These subdoses can be administered 
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as xinit dosage forms, for example, containing 5 to 10,000 xng, 
pr ferably lO to lOOO mg of active ingredient per unit dosage 
form. 

Once detectable improvement of the patient *s conditions 
5 has occurred, a maintenance dose is administered if necessary. 
Subsequently, the dosage or the frequency of administration, or 
both, can be reduced, as a function of the symptoms, to a level 
at which the improved condition is retained. When the symptoms 
have been alleviated to the desired level, treatment can cease. 

10 Patients can, however, require intermittent treatment on a long- 
term basis upon any recurrence of the disease symptoms or as a 
prophylactic measure to prevent disease symptom recurrence. The 
composition used in these therapies can be in a variety of forms. 
These include, for example, solid, semi-solid and liquid dosage 

15 forms, such as tablets, pills, powders, liquid solutions or 
suspensions, liposome preparations, injectable and infusible 
solutions. The preferred form depends on the intended mode of 
administration and therapeutic application. Typically, a sterile 
solution of a glycooon jugate DNA ligand in an aqueous solvent 

20 (e.g., saline) will be administered intravenously. The 
compositions also preferably include conventional 
pharmaceutically acceptable garriers and adjuvauits which are 
known to those of skill in the art. See, e.g.. Remington's 
Pharmaceutical Sciences . Mack Publishing Co.: Easton, PA, 17th 

25 Ed. (1985) . Generally, administration will be by oral or 
parenteral (including subcutaneous, intramuscular, intravenous, 
and intradermal) routes, or by topical application or infusion 
into a body cavity, or as a bathing solution for tissues during 
surgery. 

30 For solid compositions, conventional nontoxic solid 

carriers can be used which include, for example, pharmaceutical 
grades of mannitol, lactose, starch, magnesiiim stearate, sodium 
saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate , and the like . For oral administration , a 

35 pharmaceutically acceptable nontoxic composition is formed by 
incoirporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 0.001-95% of 
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active Ingredient, preferably about 20%. It should, of 

course, be understood that the methods of this Invention can be 
used In coinblnation with other agents that have gene expression 
modulatory activity and/ or other Immunosuppressants (e.g., 
5 cyclosporin A, FK506) . 

While It Is possible to administer the active 
Ingredient of this Invention alone, It Is believed preferable to 
present It as part of a pharmaceutical formulation. The 
formulations of the present Invention comprise at least one 

10 compound of this Invention In a therapeutically or 
pharmaceutlcally effective dose together with one or more 
pheunnaceutlcally or therapeutically acceptaO^le carriers and 
optionally other therapeutic Ingredients. Various considerations 
are described, e.g.. In Gllman et al. (eds) (1990) Goodman and 

15 Oilman's! The Pharmacological Bases of Therapeutics. 8th Ed. , 
Pergamoh Press; and Remington ' s supra, each of which is hereby 
Incorporated herein by reference. Methods for administration are 
discussed therein, e.g., for oral. Intravenous, intraperitoneal, 
or Intramuscular administration, and others. Pharmaceutlcally 

20 acceptable carriers will include water, saline, buffers, and 
other compounds described, e.g., in the Merck Index, Merck & Co., 
Rahway, NJ, incorporated herein by reference. 

A more recently devised approach for parenteral 
administration employs the Implantation of a slow-release or 

25 sustained-release system, such that a constant level of dosage 
is maintained. See, e.g., U.S. Patent No. 3,710,795, which is 
incorporated herein by reference. Glycocon jugate DNA ligands may 
be administered by transdermal patch (e.g., iontophoretic 
transfer) for local or systemic application. 

30 Kits can also be supplied for use with the subject 

glycocbnjugates for use in the protection against or therapy for 
a disease. Thus, the svibject composition of the present 
invention may be provided, usually in a lyophilized form or 
aqueous solution in a container, either alone or in conjunction 

35 with additional non-cytotoxlc glycocon jugate DNA ligands of the 
desired type. The non-cytotoxlc glycocon jugate DNA ligands are 
included in the kits with buffers, such as Trls, phosphate, 
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carbonate, etc., stabilizers, bi cides, inert proteins, e.g. , 
senm albumin, or the like, and a set f instructions for use. 
Generally, these materials will be present in less than about 5% 
vt. based on the amount of glycocon jugate, and usually present 
5 in total amount of at least about 0.001% based again on the 
concentration. Frequently, it will be desirable to include an 
inert extender or excipient to dilute the active ingredients, 
where the excipient may be present in from about 1 to 99.999% wt. 
of the total composition. 

10 

lj\ yjtyo and Research Admih^styaj^Aon 
In another aspect of the invention, calicheamicin- 
related glycocon jugate DNA ligand(s) of the invention are 
employed to modulate the expression of naturally-occurring genes 

15 or other polynucleotide sequences under the transcriptional 
control of a predetermined transcription regulatory element 
comprising a transcription factor binding site (e.g., an NFAT 
site). Transgenes, homologous recombination constructs , and 
episomal expression systems (e.g., viral-based expression 

20 vectors) comprising a polynucleotide sequence under the 
transcriptional control of one or more transcription factor 
binding site linked to a promoter will be made by those of skill 
in the art according to methods and guidance available in the 
art, as will transformed cells smd transgenic nonhuman animals 

25 harboring such polynucleotide constructs. The transcriptional 
antagonist (or agonist) glycocon jugate DNA ligands may be used 
to modulate the transcription of predetermined transcription 
factor-regulated polynucleotide sequences in cell cultures (e.g. , 
ES cells) and in intact animals, particularly in transgenic 

30 animals wherein a transgene comprises one or more predetermined 
transcription factor recognition sites as transcriptional 
regulatory sequences. For transformed or transgenic cell 
cultures, a dose-response curve is generated by titrating 
transcription rate of the recognition site-controlled 

35 polynucleotide sequence against increasing concentrations of 
selectively binding glycocon jugate DNA ligand, which will reduce 
the transcription rate reliant upon binding of the predetermined 
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transcription factor to its recognition sit . Similar d se- 
r sponse titration can be performed in transgenic animals, such 
as transgenic mice, harb ring a predetermined transcription 
factor recognition site-controlled transgene sequence. 

5 

The broad scope of this Invention is best understood 
with reference to the following examples, which are not intended 
to limit the invention in any manner* The following examples are 
of fered by way of illustration, not by way of limitation. 

10 

EXPERIMENTAL EXAMPLES 

Oyervley 

The ordered regulation of gene expression that 
determines specific developmental programs, or cellular responses 

15 to physiologic stimuli is dependent on sequence-specific DNA- 
protein Interactions. This requisite specificity suggests a 
meems to control phenotyplc responses through the use of ligands 
that (1) bind DNA with a high degree of sequence specificity, 
(11) antagonize DNA-proteln interactions either by inhibition of 

20 DNA-proteln complex formation or by displacement of preformed 
complexes, and (ill) function In vivo * Several types of 
compounds have been shown to satisfy some of these criteria, 
including oligodeoxynucleotldes Involved in triple helix 
formation, peptide nucleic acids, and other nonlntercalating DNA 

25 ligands such* as netropsin or distamycln A. However, none of 
these adequately satisfy all the requirements for specific and 
biologically useful transcriptional antagonists. 

The enedlyne compound calicheamicin 71^ (CLM, Fig. 1)/ 
representative of a novel class of heterocyclic glycocon jugate 

30 DNA ligands, binds to and cleaves DNA at specific four base pair 
sequence. Affinity cleavage experiments performed by Zein et al. 
demonstrate that CLM preferentially cleaves DNA at cytosine- 
containing homopyrimidine tracts, preferentially TCCT. 
Additional studies, however, show that in the absence of a TCCT 

35 sequence the preferential site of cleavage is TTTT despite the 
presence of a cytosine-containing sequence TCCC. Thus, although 
CLM does exhibit preferential binding at the four base pair level 
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sugg stive of base specific interactions (i.e. direct readout), 
sequence-dependent DNA structural polymorphism and c nf ormational 
flexibility are also likely to significantly influence 
specificity (i.e. indirect readout). Comparisons of the DNA 
5 cleavage specificity of CLM derivatives indicate that the 
polysaccharide/ aromatic component is largely- responsible for 
sequence-specific DNA binding. 

To demoixstrate sequence-specific DNA-ligand 
interactions, the methyl glycoside derivative of the aryl 
10 tetrasaccharide of calicheamicin CLM (hereafter referred to as 
CLM-MG or calicheamicin H6) which lacks the enediyne component 
and therefore does not cleave DNA was synthesized. 

Synthesis of Calicheamicin MG 
Calicheamicin HG was synthesized according to the 
15 general method of Halcomb et al. (1992) Ana. Chem.. Int. Ed. 
Enal. 31 ; 338 and Halcomb et al. (1992) Ana. Chem. 104 ; 314. 

ATitaj,ysi3 of pyftr-pyotein I?)teractjLoi>s 
DNA footprinting analysis Indicated that CUf-MG 

20 exhibits DNA sequence specific binding that is similar to the 
parent compound, CLM. To determine if CLM-HG binding was capable 
of interfering with DNA-protein interactions, the affect of CLM- 
MG on the binding of several distinct DNA binding proteins in 
vitro was measured using a electrophoretic mobility shift assay. 

25 The transcription factors NFAT, API, Oct-1 and Oct-1 associated 
protein (OAF), NFkB, HNF-1, and Spl represent multimeric protein 
complexes that bind to specific DNA sequences found in the 5' 
regulatory regions of a variety of inducible, constitutive, and 
tissue-specific genes. These protein complexes are 

30 representative members of several distinct classes of 
transcription factors, including the basic-leucine zipper or bZIP 
class (API) , the rel/dorsal class (NF-kB) , the homeodomain class 
which contains a helix-turn-helix motif (NHFl, Oct-1), and the 
zinc finger class (Spl) . The constituent proteins of NFAT have 

35 not been completely characterized, but appear to consist, of a 
protein sharing some characteristics with API and an unidentified 
DNA-binding protein that undergoes cytoplasmic to nuclear 
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translocation upon T cell antigen receptor stimulation. 
Predict d CLH binding sites are present within four of the six 
DNA sequences to which these transcription fact rs bind (Fig. 2) . 
PurtheriDore, only the NFAT recognition sequence contains the 
5 presumed preferential CLM recognition site TCCT. 

Nuclear extracts containing the various DNA binding 
activities were prepared as previously described (Fiering et al, 
(1990) Genes Devel. ±z 1823, incorporated herein by reference) 
from TAg.Jurkat cells (a derivative of the human T cell leukemia 

10 cell line Jurkat transfected with the SV40 large T antigen 
(Northrop et al. (1993) J. Biol, Chem. 268 ; 2917) stimulated with 
2 MM ionomycin plus 20 ng/ml 12-0-tetradecanoyl-phorbol-13- 
acetate (TPA) for two hours (NFAT, API, Octl, NF-kB) , HeLa cells 
stimulated with 20 ng/ml TPA for 2 hours (API, Octl, NF-kB, Spl) , 

15 or MDCK cells (HNF-1) - Cells were maintained in RPMI-1640 
supplemented with 10% (vol/vol) heat-inactivated fetal calf 
serum, 100 U/ml penicillin, 100 Mg/ml streptomycin, 2 mH 
glutamine and 50 j3-mercaptoethanol (complete medium) in a 5% 
Cb2/95% air humidified atmosphere. CLM-MG was solubilized in 10% 

20 EtOH or DMSO. Electrophoretic mobility shift assays were 
performed by incubating varying concentrations of CLM-MG or the 
appropriate buffer control with 0.1 ng DNA probe (approx. 20,000 
cpm, see below) for 15 minutes at room temperature, followed by 
the addition of 2 ixL (5-10 Mg) nuclear extract, in a final volume 

25 of 15 fiL. The incubation was continued for 45 minuties at room 
temperature. The binding buffer consisted of 10 mM Tris-Cl 
(pH 7.6), 80 mM NaCl, 1 mM EDTA, 0.5 inM dithiothreitol, 5% 
glycerol, and 2.5 jtxg poly d(I-C) . Samples were subsequently 
loaded onto a 4% non-denaturing polyacrylamide gel in 0.5X TBE 

30 and electrophoresed at 180 volts for approximately 1.5-2 hours 
at room temperature. DNA probes (listed below) were labelled 
with ^2p-Y-ATP and polynucleotide kinase, followed by filling-in 
5« overhang with dNTPs and the Klenow fragment of DNA polymerase 
(as necessary) . Alternatively, labelling was performed by 

35 filling-in 5« overhangs with ^^p-a-dCTP, dATP, dTTP, dGTP and the 
Klenow fragment of DNA polymerase. Protein-DNA complex formation 
was measured directly using a radioanalytic imaging system (Ambis 
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Imaging, Xnc«)« The following ligonucleotides were utilized as 
probes for the gel shift assays: 



10 



15 



20 



25 



30 



35 



40 



NPAT 

API 

NFkB 

Octl 

Octl/OAP 

HNF-1 

Spl 



GATCTAAGGAG GAAAA ACTGTTTCATG 

ATTCCTCCTTTTTGACAAAGTACCTAG 

TCGAGTGACTCAGCGCG 

CACTGAGTCGCGCAGCT 

AGGGATTTCAC 
TCCCTAAAGTG 



TGTAATATGTAAAACATTT 
ATTATACATTTTGTAAAAC 

TTT GAAAA TAGTGTAATATGTA AAAC AT 

CTTTTATACACATTATACATTTTGTAAAA 

CAAACTGTCAAATATTAACTAAAGGGAG 
GTTTGACAGTTTATAATTGATTTCCCTC 

TCGAGGGGCGGGGC 

CCCCGCCCCGA6CT 



The presence of CLH-MG in DNA-protein binding reactions 
resulted in the complete inhibition of NFAT complex formation at 
100 mH CLM-HG. In contrast, there was no significant inhibition 
of API or Spl complex formation at concentrations of CLM-MG which 
significantly inhibited NFAT-DNA complex formation and minimal 
inhibition at concentrations as high as 100 /iM. NP-kB, Oct-1, 
and HNF-1 showed intermediate levels of inhibition of DNA-protein 
complex formation by CLM-MG (Fig. 3). The greater sensitivity 
of CAP to inhibition by CLM-MG as compared to Oct-1 is likely a 
reflection of the dependence of OAP binding on Oct-1 binding. 
These results indicate that CLM-MG is capable of inhibiting DNA- 
protein complex formation and are consistent with the result 
predicted based on the presence or absence of CLM binding sites 
within the transcription factor recognition sequences. 

Studies of the kinetics of CLM-MG inhibition of DNA- 
protein interaction showed that preincxibation of CLM-MG with DNA 
was not necessary for complete inhibition of NFAT-DNA complex 
formation. Furthermore, CLM-MG rapidly displaced preformed NFAT- 
DNA complexes, with 80% inhibition of NFAT-DNA complex formation 
observ d within 15 seconds of the addition of CLM-MG followed by 
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separation of bound and unbound DNA by non-^denaturing acxylamide 
gel el ctrophoresls. In contrast, addition of excess unlabeled 
DNA probe resulted in far less dissociation of the DNA-protein 
complex. The rapid dissociation of preformed DNA-protein 
5 complexes by CLH-MG indicates that sensitivity to inhibition by 
CLH-MG is not dependent on differences in DNA-protein complex 
dissociation rates. In addition, the similarity between the rate 
of dissociation of preformed NFAT-DNA induced by CLM-M6 at room 
temperatvire and the apparent rate at 4®C is consistent with the 

10 rate constant for the association of CLH-HG and DNA being 
diffusion-collision limited. 

The differential sensitivity of the NPAT-DNA, HNFl-DNA, 
and AFl-DNA complexes to disruption by CLH-M6 is shown in Fig. 
4, demonstrating that NFAT-DNA complexes are most sensitive to 

15 disiruption by CLM-H6. 

To provide further evidence that CLM-MG inhibits DNA- 
protein complex formation by binding to DNA in a sequence 
specific manner, alterations were made in the putative CLH-HG 
binding sites within the NFAT recognition sequence (Fig. 5) . 

20 Substitution of thymidine for guanine in the first guanine 
dinucleotide resulted in decreased sensitivity of the NFAT 
. complex to inhibition by CLH-HG (Figs. 5 and 6) . Alterations in 
the second guemine dinucleotide eliminated NFAT binding, 
consistent with the identification of these nucleotides as 

25 contact sites by methylation interference studies. The presence 
of overlapping CLM-MG binding sites within the NFAT recognition 
sequence gives rise to the prediction that increasing the 
distance between these sites may increase sensitivity to CLM-MG, 
since overlapping sites may result in decreased occupation of the 

30 site more directly involved in protein contacts. Comparison 
between the wild-type NFAT recognition sequence and an altered 
sequence containing an inserted adenine nucleotide shows that 
binding of NFAT to the later sequence is more sensitive to 
inhibition by CLM-MG (Fig. 6) . Cross^competition studies between 

35 the wild-type and altered DNA sequences demonstrate that NFAT 
binds to the altered DNA sequences with approximately equal 
affinity as the wild-type sequence. Thus, the extent to which 



W09SA)S389 



PCT/USW/09123 



54 

NFAT-DNA complex formation is inhibited by CLM-MG is dependent 
on the presence of CLM-HG binding sites, providing fiirther 
evidence demonstrating that CLM-MG functions to inhibit DNA- 
protein interaction by binding to DNA in a sequence specific 
5 manner. 

The converse experiment of generating CLM-MG 
sensitivity by introduction of a CLM-MG . binding site into a DNA 
protein recognition sequence that is not susceptible to 
inhibition to CLM-MG was performed by altering the API 

10 recognition sequence. A CLM-MG binding site was introduced 
immediately adjacent (3*) to the API recognition sequence 
ACTGTCA. Comparison between the wild type API sequence and the 
altered sequence shows that introduction of a CLM-MG binding site 
increases the sensitivity of APl-DNA complexes to CLM-MG. 

15 However, these changes also resulted in less binding by API and 
therefore the possibility that the increased sensitivity to CLM- 
MG is a result of a decreased protein-DNA affinity cannot be 
excluded. 

Transcriptional Assavs 
20 Finally, to determine whether the sequence-specific 

inhibition of DNA-protein interactions by CLM-MG observed in 
vitro could be manifest in vivo , the affect of CI«-MG on the 
expression of the secreted alkaline phosphatase reporter gene 
directed by multimerized NFAT, API, NFkB or Octl/OAP recognition 
25 sec[uences located upstream of the IL-2 minimal promoter was 
measured in transient transfection assays. 

Reporter constructs driving expression of an alkaline 
phosphatase gene under transcriptional control of cis-linked 
sequences were made for transient expression assays. NFAT-SX 
30 contains three copies of the NFAT recognition sequence (-286 to - 
257 of the human IL-2 enhancer) . APl-SX contains five copies of 
the htunan metallothionein promoter /enhancer AP-1 binding site 
(5'-TGACTCAGCGC-3«) . NFkB-SX contains three copies of the of the 
murine k light chain NFkB binding site. Octl/OAP-SX contains 
35 four copies of the Octl/OAP site (ARREl; -93 to -63 of the human. 
IL-2 enhancer) . 

TAg Jurkat cells were transiently transfected with 4 
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/ig of the indicated reporter; plasmid by electroporation in 
complete media (Bio-Rad GenePulser; 960 fiF, 250 V, 0.4 cm cuvette 
width) . Cells were harvested after approximately 24 hours and 
aliquoted into 96-well flat bottom microtitre plates (2 x 10^ 
5 cells per well in 100 Ml of complete medium) . Varying 
concentrations of CLM-MG or the control buffjer were added to 
duplicate wells in a 5 or 10 ^1 voliame. After 60 minutes at 
37«C, ionomycin and TPA were added in 100 ^1 complete medixim to 
final concentrations of 1 /xM and 20 ng/ml, respectively. 

10 Secreted aUcaline phosphatase activity was measured after 12-16 
hours as described in Clipstone N and Crabtree GR (1992) Nature 
3 57 : 695, incorporated herein by reference. 

Expression of each of these reporter constructs 
requires stimulation of the transfected cells by cross-linking 

15 the T cell antigen receptor or by treatment with ionomycin and/or 
TPA. CUf-K6 markedly inhibited the induced expression of the 
reporter gene driven by the NFAT-IL2 promoter with a half-maximal 
inhibitory concentration of 50-75 iM. In contrast, expression 
of the reporter gene driven by the AP1-IL2 promoter showed 

20 significantly less inhibition by CLM-MG, with a half-maximal 
inhibitory concentration of 225 mM, The NFkB-lL2 and Octl/OAP- 
IL2 promoters were inhibited to an intermediate level (Fig* 7). 
These results reflect in vivo the same specificity of inhibition 
of DNA-protein complex formation by CLM-MG that was observed in 

25 vitro and are consistent with CLM-MG inhibiting gene expression 
in vivo by inhibiting protein-DNA complex formation in a sequence 
specific manner. Although in these experiments CLM-MG was added 
to cells 60 minutes prior to stimulation, in other experiments 
significant inhibition was seen if CLM-MG was added only 5 

30 minutes prior to stimulation. This is consistent with the rapid 
rate in which CLM-MG binds DNA in vitro . Interestingly, addition 
of CLM-MG 15 minutes after stimulation resulted in minimal 
inhibition of reporter gene expression, indicating that once 
transcription is initiated either (i) enhancer binding proteins 

35 are no longer required for transcription, (ii) enhancer binding 
proteins are no longer susceptible to displacement by CLM-MG, 
(iii) CLM-MG acts by inhibiting the formation of DNA-protein 
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complexes but cannot displace pr formed c mplexes, or (Iv) 
alternate mechanisms are involved. 

Transcription factors typically form interactions with 
DNA along the major groove. CLH, in contrast, appeztrs to occupy 
5 the minor groove. Assuming the DNA-protein interactions that are 
sensitive to inhibition by CLM-M6 depend primarily on major 
groove contacts, the rapid displacement of preformed DNA*protein 
complexes by CLM-HG is unlikely to result from direct competition 
for a shared binding site. Rather, CLH-H6 binding in the minor 

10 groove may either induce a DNA conformation that is incompatible 
with protein binding along the major groove or render the 
specific region of DNA more rigid, thereby prohibiting DNA 
conformational changes that may be required for effective binding 
protein. This hypothetical binding interaction contrasts with 

15 that of the minor groove ligand netropsin, which binds triple 
helix DNA without displacing the major groove-bound third strand, 
or l-methylimidazole-2-carboxamide netropsin, which can bind 
simultaneously with the major groove-binding protein 6CN4(226- 
281} at a common binding site. Thus, targeting the minor groove, 

20 which appears to be accessible to small ligands in the presence 
of protein interacting along the major groove, may be a useful 
strategy in the design of DNA ligands capable of conferring 
sufficient conf omnational constraints upon DNA to antagonize DNA- 
protein interactions and thereby inhibit gene expression. As 

25 with most interactions between complex molecules, the specificity 
of the interaction between CLM-MG and DNA most likely results 
from contributions of specific hydrogen bond configurations 
substituting for the spine of hydration present in the minor 
groove of DNA and close van der Waals contacts between CLM-MG and 

30 appropriate constituents of DNA present along the floor and walls 
of the minor groove. Electrostatic interactions between CLM-MG 
and DNA likely contribute less to the specificity of the 
interaction due to the lack of charged groups in CLM-MG. Further 
understanding of the precise determinants of sequence-specific 

35 binding by CLM-MG and of the exact mechanism by which CLM-MG 
inhibits DNA-protein interactions and displaces preformed DNA- 
pr tein complex s can be determined by direct structural analyses 
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Of cut binding sequ nces and CLM-DNA complexes. The results 
presented here demonstrate that CLH-MG exhibits many of the 
properties required for biologically useful, sequence-specific 
transcriptional antagonists and suggest that CLM-H6 and other 
5 glycocon jugate DNA ligands may serve as lead compounds in the 
development of novel DNA-targeted therapeutic agents and biologic 
probes, among many other uses. 

Although the present invention has been described in 
some detail by way of illustration for purposes of clarity of 
10 understanding, it will be appeurent that certain changes and 
modifications may be practiced within the scope of the claims. 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SBQUENCS CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
AAGGAGGAAA AACT6TTTCA T 



(2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGGGCGGGGC 10 



(2) INFORMATION FOR SEQ ID NO: 3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTGACTCAGC GCG 13 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 4: 
AAGAGGAAAA A 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 5: 
GAAA6GAGGA AAAACTGTTT 20 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCAAAGAGGA AAATTTGTTT 20 
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(2) INFORMATION FOR S£Q ID N0:7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAGAAGAGGA AAAATGAAGG 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TCCAGGAGAA AAAATGCCTC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AAAACTTGTG AAAATACGTA 
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(2) INFORmnOK FOR SEQ ID K0:10: 

(1) SEQUENCE CEARACTERISTICS: 

(A) LEK6TH: 20 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genonic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TAAAGOAGAG AACACCAGCT 

(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CaARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genonic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCAGGGT6GG AAAGGCCTTT 20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GATCTAAGGA GGAAAAACTG TTTCATG 27 
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(2) INFOIWATION FOR SEQ ID M0:13: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLCXSY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
GATCCATGAA ACAGTTTTTC CTCCTTA 27 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TCGAGTGACT CAGCGCG 17 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCGACGCGCT GAGTCAC 17 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGGGATTTCA C 



(2) INFOR!tATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTGAAATCCC T 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linesu: 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TGTAATATGT AAAACATTT 
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(2) INFORH^TIOH FOR SEQ ID MO: 19: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAAAATGTTT TACATATTA 



(2) INFORMATION FOR SEQ ID NO: 20: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
TTTGAAAATA GTGTAATATG TAAAACAT 



(2) INFORMATION FOR SEQ ID NO: 21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AAAATGTTTT ACATATTACA CATATTTTC 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CAAACTGTCA AATATTAACT AAAGG6A6 



(2) INFORMATION FOR SEQ ID NO:23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTCCCTTTAG TTAATATTT6 ACAGTTTG 



(2) INFORMATION FOR SEQ ID NO: 24: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(O) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TCGAGGGGCG GGGC 
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(2) INFORMATION FOR SSQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCGAGCCCCG CCCC 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TGACTCAGCG C 



(2) INFORMATION FOR SEQ ID NO:27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TGTAATATGT AAAACATTTT G 
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(2) INFORMATION FOR SSQ ID N0:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: UNA (genomic) 



(xl) SEQUENCE DESCRIPTION: SBQ ID NO: 28: 
GATCTTTGAA AATAT6TGTA ATATGTAAAA CATTTTG 



(2) INFORMATION FOR SEQ ID NO: 29: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NP:29: 
GAAGGAGGAA AAA 



(2) INFORMATION FOR SEQ ID NO:30: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GAAGTAGGAA AAA 
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(2) mFORMATION FOR SEQ ID NO: 31: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic add 

(C) STRANDBONESS: single 

(D) TOPOLOGY: linear 

(ii) KOLECOLE TYPE: DNA (genomic) 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0:31: 
GAAG6AAGGA AAAA 
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1. A composition f a glycocon jugate DNA ligand lacking 
DNA strand cleavage activity, comprising a compound according t 
the structure 




5 wherein is selected from the group consisting of hydrogen, 
hydroxy, lower alley 1, lower alkoxy, aryloxy, and alkanol; and Ra 
is selected from the group consisting of hydrogen, lower alkyl, 
lower alkoxy, iodo, bromo, fluoro, and chloro. 

10 2« A composition comprising a glycocon jugate DNA ligand 

of claim 1, wherein R^^ is selected from the group consisting of 
methoxy, ethoxy, propyloxy, methyl, ethyl, and hydrogen. 

3. A composition of claim 2, wherein R2 is iodo, bromo, 
15 chloro, or fluoro. 

4. A composition of claim 1, wherein the glycoconjugate 
DNA ligand binds to DNA in a sequence-selective manner with 
preferential binding to the tetranucleotide sequences TCCT and 

20 TCTC. 

5. A composition of claim 4, wherein the glycoconjugate 
DNA ligand preferentially binds to an NFAT recognition sequence 
as compared to an API or Spl recognition sequence. 

25 

6. . A composition of claim 5, wherein the glycoconjugate 
DNA ligand inhibits the formation of an NFAT-DNA complex between 
NFAT and an NFAT recognition sequence or displaces NFAT from a 
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preformed NFAT-DNA compl x. 

7. A composlti n of claim 6, wherein the glyc c njugate 
DNA llgand is calicheamicin MG 

5 

8. A glycqcon jugate DNA ligand having the structxire: 




9. A glycoconjugate DNA ligand of claim 8 which is labeled 
with a radioisotope or spin label. 

10 

10. A glycoconjugate DNA ligand of claim 8 noncovalently 
bound to a polynucleotide comprising a NFAT recognition . site 
sequence forming a glycoconjugate-liganded NFAT recognition site. 

15 11. A glycoconjugate-liganded NFAT recognition site 

comprising calicheamicin H6 bound to a polynucleotide comprising 
a NFAT recognition site sequence. 

.12. A glycoconjugate-liganded NFAT recognition site of 
20 claim 11, wherein the polynucleotide is present in a mammalian 
cell. 

13. A glycoconjugate-liganded NFAT recognition site of 
claim 12, wherein the polynucleotide present in said mammalian 

25 cell is replicable. 

14. A method for modulating the transcriptional activity 
of polynucleotide sequences under the transcriptional influence 
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of an perably linked cls-actlng sequence t which a DNA-blndlng 
pro'teln binds, comprising the step of: 

administering a sequence-selective glycocon jugate DNA ligand 
to a cell or to an In vitro transcription reaction wherein the 
5 glycocon jugate DNA ligand binds to said operably linked cis* 
acting sequence and alters binding of said DNA~binding protein 
to said operably linked cis-acting sequence, thereby modulating 
transcription of the polynucleotides sequence under the 
transcriptional influence of said operably linked cis-acting 
10 sequence. 

15 . A method of claim 14 , wherein the operably linked cis- 
acting sequence comprises a transcription factor recognition site 
and the DNA-binding protein is a treuiscription factor. 

15 

16. A method of claim 15, wherein the transcription factor 
recognition site is an NF-AT recognition site and the DNA-binding 
protein comprises NF«*AT« 

20 i7. A method of claim 16, wherein the NF-AT recognition 

site comprises the polypeptide sequence spanning -286 to -257 
^ upstream of the humetn IL-2 gene. 

18. A method for producing immunosuppression in a human 
25 patient, comprising administering a prophylactically or 

therapeutically effective dose of a glycocon jugate DNA ligamd 
that selectively binds to a NF-AT recognition site and inhibits 
NF-AT-dependent transcription, thereby modulating expression of 
a NF-AT-dependent gene in a T lymphocyte and inhibiting T 
30 lymphocyte activation. 

19. A method for identifying a sequence-selective 
glycocon jugate DNA ligand, comprising the steps of: 

administering to a binding reaction comprising a 
35 predetermined DNA-binding protein euid a polynucleotide comprising 
a recognition site for said predetermined DNA-binding protein a 
glycocon jugate DNA ligand having a structure according to the 
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formula: 




CH30 g 



where is hydrogen, halogen, hydroxy, lower allcyl, lower 
alkoxy, or aryloxy; is a halogen, hydroxy, or lower alkyl; R3 
is lower allcyl, lower aUcoxy, or hydrogen; and R4 and R5 are 
5 independently selected from lower alkyl, lower alkoxy, or 
hydrogen; and 

detecting the ability of the glycocon jugate DNA ligand 
to inhibit binding of the predetermined DNA-binding protein to 
the recognition site, and; 
10 identifying a glycoconjugate DNA ligand which inhibits 

said binding of said predetermined DNA-binding protein to said 
recognition sequence as a sequence-selective glycoconjugate DNA 
ligand. 

15 20. A method of claim 19, wherein at least one alternative 

DNA-binding protein and a polynucleotide comprising a recognition 
site to which the alternative DNA-binding protein binds are 
included in the binding reaction and wherein the step of 
detecting comprises determining the differential inhibition of 

20 binding of said predetermined DNA-binding protein to its 
recognition site as compared to an inhibition of binding of said 
alternative DNA-binding protein to its recognition site. 

21. A method for identifying glycoconjugate DNA ligands 
25 which are differential transcriptional agonists or antagonists, 
comprising the steps of: 

administering to a transcription assay comprising a 
reporter polynucleotide sequence under the transcriptional 
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Influence of a operably linked cis-acting recognition site for 
a predetermined transcription fact r a glycoc njugate DMA ligand 
having a structure according to the formula: 




CHgO g 

Where is hydrogen, halogen, hydroxy, lower alkyl, lower 
5 alkoxy, or aryloxy; is a halogen, hydroxy, or lower alkyl; R3 
is lower alkyl, lower alkoxy, or hydrogen; and R4 and R5 are 
independently selected from lower alkyl, lower alkoxy, or 
hydrogen; and 

detecting the ability of the glycocon jugate DNA ligand 
10 to modulate transcription of the reporter polnucleotide sequence 
dependent on the operably linked cis--acting transcription factor 
recognition secpience and predetermined transcription factor as 
compared to a control transcription reaction lacking the 
glycocon jugate DNA ligand, and; 
15 identifying a glycocon jugate DNA ligand which inhibits 

the predetermined tramscription factor-dependent transcription 
of said reporter polynucleotide sequence. 

22. A method of claim 21, wherein the transcription 
20 assay comprises a mammalian cell comprising a predetermined 
trancription factor and a reporter polynucleotide operably linked 
to a cis-*acting transcription factor recognition sequence bound 
by said predetermined transcription factor. 

25 23. A method of claim 22, wherein the mammalian cell 

is transfected with NFAT-SX, APl-SX, NFkB-SX, or Octl-SX. 
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Aaggoggaaooo CT6TTTCAT 

.6T6ACTCAGC6C6 

ogggATTTCAC 

T6TAATAT6ToaooCAtttt6 

6ATCTTT6ooooTAT6TGTAATAT6ToaooCAtttt6 

CAAACTGTCAAATATTAACTAAogggAG 
6666C6666C 
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